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(57) Abstract: The present invention 
relates to the modification of lignin 
biosynthesis in plants, using the 
nucleotide sequences encoding the 
enzymes 4-coumarate CoA-ligase 
(4CL), cinnamoyl-CoA reductase (CCR), 
cinnamyl alcohol dehydrogenase (CAD) 
of the lignin biosynthetic pathway, from 
ryegrass {Lolium) and fescue {Festuca). 
The present invention also relates to 
regulatory elements, promoters capable 
of causing expression of exogenous 
genes in plants, wherein the regulatory 
elements are from the genes for caffeic 
acid Omethyl transferase (OMT), 4CL, 
CCR or CAD. The invention also relates 
to vectors including the nucleic acids 
and regulatory elements of the invention, 
plant cells, plants, plant seeds and 
other plant parts transformed with the 
regulatory elements, nucleic acids and 
vectors and methods using the nucleic 
acids, regulatory elements and vectors. 
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MODIFICATION OF LIGNIN BIOSYNTHESIS 

The present invention relates to the modification of lignin biosynthesis in 
plants and, more particularly, to enzymes involved in the lignin biosynthetic 
pathway and nucleic acids encoding such enzymes. 

The present invention also relates to a regulatory element and, more 
particularly, to a promoter capable of causing expression of an exogenous 
gene in plant cells, such as a gene encoding an enzyme involved in the lignin 
biosynthetic pathway in plants. 



The invention also relates to vectors including the nucleic acids and 
10 regulatory elements of the invention, plant cells, plants, seeds and other plant 
parts transformed with the regulatory elements, nucleic acids and vectors, and 
methods of using the nucleic acids, regulatory elements and vectors. 

Lignins are complex phenolic polymers that strengthen plant cell walls 
against mechanical and chemical degradation. The process of lignification 

15 typically occurs during secondary thickening of the walls of cells with structural, 
conductive or defensive roles. Three monolignol precursors, sinapyl, coniferyl 
and p-coumaryl alcohol combine by dehydrogenative polymerisation to 
produce respectively the syringyl(S), guaiacyl(G) and hydroxyl(H) subunits of 
the lignin polymer, which can also become linked to cell-wall polysaccharides 

20 through the action of peroxidases and other oxidative isozymes. In grasses, 
biosynthesis of the monolignol precursors is a multistep process beginning 
with the aromatic amino-acids phenylalanine and tyrosine. It is the final two 
reduction/ dehydrogenation steps of the pathway, catalysed by Cinnamoyl CoA 
Reductase (CCR) and Cinnamyl Alcohol Dehydrogenase (CAD) that are 

25 considered to be specific to lignin biosynthesis. The proportions of monolignols 
incorporated into the lignin polymer vary depending on plant species, tissue, 
developmental stage and sub-cellular location. 



Caffeic acid Omethyl transferase (OMT), 4 coumarate CoA-ligase 
(4CL), cinnamoyl-CoA reductase (CCR) and cinnamyl alcohol dehydrogenase 
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(CAD) are key enzymes involved in lignin biosynthesis. 

Worldwide permanent pasture is estimated to cover 70% of 
agriculturally cultivated area. Ryegrasses (Lolium spp.) together with the 
closely related fescues (Festuca spp.) are of significant value in temperate 
5 grasslands. The commercially most important ryegrasses are Italian or annual 
ryegrass (L multiflorum Lam.) and perennial ryegrass (L perenne L). They 
are the key forage species in countries where livestock production is an 
intensive enterprise, such as the Netherlands, United Kingdom and New 
Zealand. The commercially most important fescues are tall fescue (F 
10 anundinacea Schreb.), meadow fescue (F pratensis) and red fescue (F 
rubra). 

Perennial ryegrass (Lolium perenne L) is the major grass species sown 
in temperate dairy pastures in Australia, and the key pasture grass in 
temperate climates throughout the world. A marked decline of the feeding 
15 value of grasses is observed in temperate pastures of Australia during late 
spring and early summer, where the nutritive value of perennial ryegrass 
based pasture is often insufficient to meet the metabolic demands of lactating 
dairy cattle. Perennial ryegrass is also an important turf grass. 

Grass and legume in vitro dry matter digestibility has been negatively 
20 correlated with lignin content. In addition, natural mutants of lignin biosynthetic 
enzymes in maize, sorghum and pearl millet that have higher rumen 
digestibility have been characterised as having lower lignin content and altered 
S/G subunit ratio. Thus, lignification of plant cell walls is the major factor 
identified as responsible for lowering digestibility of forage tissues as they 
25 mature. 

It would be desirable to have methods of altering lignin biosynthesis in 
plants, including grass species such as ryegrasses and fescues, by reducing 
the activity of key biosynthetic enzymes in order to reduce lignin content and/or 
alter lignin composition for enhancing dry matter digesitibility and improving 
30 herbage quality. However, for some applications it may be desirable to 
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enhance lignin biosynthesis to increase lignin content and/or alter lignin 
composition, for example to increase mechanical strength of wood, to increase 
mechanical strength of turf grasses, to reduce plant height and reduce lodging 
or improve disease resistance. 

5 While nucleic acid sequences encoding some of the enzymes involved 

in the lignin biosynthetic pathway have been isolated for certain species of 
plants, there remains a need for materials useful in the modification of lignin 
biosynthesis in plants, particularly grass species such as ryegrasses and 
fescues. 

10 Other phenotypic traits which may be improved by transgenic 

manipulation of plants include disease resistance, mineral content, nutrient 
quality and drought tolerance. 

However, transgenic manipulation of phenotypic traits in plants requires 
the availability of regulatory elements capable of causing the expression of 
15 exogenous genes in plant cells. 

It is an object of the present invention to overcome, or at least alleviate, 
one or more of the difficulties or deficiencies associated with the prior art. 

In one aspect, the present invention provides substantially purified or 
isolated nucleic acids or nucleic acid fragments encoding the following 
20 enzymes from a ryegrass (Lolium) or fescue (Festuca) species: 4 coumarate 
CoA-ligase (4CL), cinnamoyl-CoA reductase (CCR) and cinnamyl alcohol 
dehydrogenase (CAD). 

The ryegrass (Lolium) or fescue (Festuca) species may be of any 
suitable type, including Italian or annual ryegrass, perennial ryegrass, tall 
25 fescue, meadow fescue and red fescue. Preferably the ryegrass or fescue 
species is a ryegrass, more preferably perennial ryegrass (Lolium perenne). 

The nucleic acid or nucleic acid fragment may be of any suitable type 
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and includes DNA (such as cDNA or genomic DNA) and RNA (such as mRNA) 
that is single- or double- stranded, optionally containing synthetic, non-natural 
or altered nucleotide bases, and combinations thereof. 

The term "isolated" means that the material is removed from its original 
5 environment (eg. the natural environment if it is naturally occurring). For 
example, a naturally occurring nucleic acid present in a living plant is not 
isolated, but the same nucleic acid separated from some or all of the 
coexisting materials in the natural system, is isolated. Such nucleic acids 
could be part of a vector and/or such nucleic acids could be part of a 
10 composition, and still be isolated in that such a vector or composition is not 
part of its natural environment. 

In a preferred embodiment of this aspect of the invention, the 
substantially purified or isolated nucleic acid or nucleic acid fragment encoding 
4CL includes a nucleotide sequence selected from the group consisting of (a) 
15 sequences shown in Figures 2, 3 and 4 hereto (Sequence ID Nos: 1, 3 and 5; 
respectively) (b) complements of the sequences shown in Figures 2, 3 and 4 
hereto (Sequence ID Nos: 1, 3 and 5, respectively); (c) sequences antisense 
to the sequences recited in (a) and (b); and (d) functionally active fragments 
and variants of the sequences recited in (a), (b) and (c). 

20 In a further preferred embodiment of this aspect of the invention, the 

substantially purified or isolated nucleic acid or nucleic acid fragment encoding 
CCR includes a nucleotide sequence selected from the group consisting of (a) 
the sequence shown in Figure 10 hereto (Sequence ID No: 7); (b) the 
complement of the sequence shown in Figure 10 hereto (Sequence ID No: 7); 

25 (c) sequences antisense to the sequences recited in (a) and (b); and (d) 
functionally active fragments and variants of the sequences recited in (a), (b) 
and (c). 

In a still further preferred embodiment of this aspect of the invention, the 
substantially purified or isolated nucleic acid or nucleic acid fragment encoding 
30 CAD includes a nucleotide sequence selected from the group consisting of (a) 
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the sequences shown in Figures 13, 14, 26 and 27 hereto (Sequence ID Nos: 
9, 11, 14 and 16, respectively); (b) complements of the sequences shown in 
Figures 13, 14, 26 and 27 hereto (Sequence ID Nos: 9, 11, 14 and 16, 
respectively); (c) sequences antisense to the sequences recited in (a) and (b); 
5 and (d) functionally active fragments and variants of the sequences recited in 
(a), (b) and (c). 

By "functionally active" is meant that the fragment or variant (such as an 
analogue, derivative or mutant) is capable of modifying lignin biosynthesis in a 
plant. Such variants include naturally occurring allelic variants and non- 
10 naturally occurring variants. Additions, deletions, substitutions and 
derivatizations of one or more of the nucleotides are contemplated so long as 
the modifications do not result in loss of functional activity of the fragment or 
variant. Preferably the functionally active fragment or variant has at least 
approximately 80% identity to the relevant part of the above mentioned 
15 sequence, more preferably at least approximately 90% identity, most 
preferably at least approximately 95% identity. Such functionally active 
variants and fragments include, for example, those having nucleic acid 
changes which result in conservative amino acid substitutions of one or more 
residues in the corresponding amino acid sequence. Preferably the fragment 
20 has a size of at least 10 nucleotides, more preferably at least 15 nucleotides, 
most preferably at least 20 nucleotides. 

In a second aspect of the present invention there is provided a vector 
including a nucleic acid or nucleic acid fragment according to the present 
invention. 

25 In a preferred embodiment of this aspect of the invention, the vector 

may include a regulatory element such as a promoter, a nucleic acid or nucleic 
acid fragment according to the present invention and a terminator; said 
regulatory element, nucleic acid or nucleic acid fragment and terminator being 
operatively linked. 

30 By "operatively linked" is meant that said regulatory element is capable 
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of causing expression of said nucleic acid or nucleic acid fragment in a plant 
cell and said terminator is capable of terminating expression of said nucleic 
acid or nucleic acid fragment in a plant cell. Preferably, said regulatory element 
is upstream of said nucleic acid or nucleic acid fragment and said terminator is 
5 downstream of said nucleic acid or nucleic acid fragment. 

The vector may be of any suitable type and may be viral or non-viral. 
The vector may be an expression vector. Such vectors include chromosomal, 
non-chromosomal and synthetic nucleic acid sequences, eg. derivatives of 
plant viruses; bacterial plasmids; derivatives of the Ti plasmid from 
10 Agrobacterium tumefaciens; derivatives of the Ri plasmid from Agrobacterium 
rhizogenes; phage DNA; yeast artificial chromosomes; bacterial artificial 
chromosomes; binary bacterial artificial chromosomes; vectors derived from 
combinations of plasmids and phage DNA. However, any other vector may be 
used as long as it is replicable or integrative or viable in the plant cell. 

15 The regulatory element and terminator may be of any suitable type and 

may be endogenous to the target plant cell or may be exogenous, provided 
that they are functional in the target plant cell. 

Preferably the regulatory element is a promoter. A variety of promoters 
which may be employed in the vectors of the present invention are well known 

20 to those skilled in the art. Factors influencing the choice of promoter include 
the desired tissue specificity of the vector, and whether constitutive or 
inducible expression is desired and the nature of the plant cell to be 
transformed (eg. monocotyledon or dicotyledon). Particularly suitable 
promoters include the Cauliflower Mosaic Virus 35S (CaMV 35S) promoter, the 

25 maize Ubiquitin promoter, the rice Actin promoter, and ryegrass endogenous 
OMT, 4CL, CCR or CAD promoters. 

A variety of terminators which may be employed in the vectors of the 
present invention are also well known to those skilled in the art. The terminator 
may be from the same gene as the promoter sequence or a different gene. 
30 Particularly suitable terminators are polyadenylation signals, such as the 
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CaMV 35S polyA and other terminators from the nopaline synthase (nos) and 
the octopine synthase (ocs) genes. 

The vector, in addition to the regulatory element, the nucleic acid or 
nucleic acid fragment of the present invention and the terminator, may include 
5 further elements necessary for expression of the nucleic acid or nucleic acid 
fragment, in different combinations, for example vector backbone, origin of 
replication (ori), multiple cloning sites, spacer sequences, enhancers, introns 
(such as the maize Ubiquitin Ubi intron), antibiotic resistance genes and other 
selectable marker genes [such as the neomycin phosphotransferase (npt2) 
10 gene, the hygromycin phosphotransferase (hph) gene, the phosphinothricin 
acetyltransf erase (bar or pat) gene], and reporter genes (such as beta- 
glucuronidase (GUS) gene (gusA)]. The vector may also contain a ribosome 
binding site for translation initiation. The vector may also include appropriate 
sequences for amplifying expression. 

15 As an alternative to use of a selectable marker gene to provide a 

phenotypic trait for selection of transformed host cells, the presence of the 
vector in transformed cells may be determined by other techniques well known 
in the art, such as PCR (polymerase chain reaction), Southern blot 
hybridisation analysis, histochemical GUS assays, northern and Western blot 

20 hybridisation analyses. 

Those skilled in the art will appreciate that the various components of 
the vector are operatively linked, so as to result in expression of said nucleic 
acid or nucleic acid fragment. Techniques for operatively linking the 
components of the vector of the present invention are well known to those 
25 skilled in the art. Such techniques include the use of linkers, such as synthetic 
linkers, for example including one or more restriction enzyme sites. 

The vectors of the present invention may be incorporated into a variety 
of plants, including monocotyledons (such as grasses from the genera Lolium, 
Festuca, Paspalum, Pennisetum, Panicum and other forage and turf grasses, 
30 corn, oat, sugarcane, wheat and barley), dicotyledons (such as arabidopsis, 
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tobacco, legumes, alfalfa, oak, eucalyptus, maple, canola, soybean and 
chickpea) and gymnosperms. In a preferred embodiment, the vectors are 
used to transform monocotyledons, preferably grass species such as 
ryegrasses (Lolium species) and fescues (Festuca species), more preferably 
5 perennial ryegrass (Lolium perenne) including forage and turf type cultivars. 

Techniques for incorporating the vectors of the present invention into 
plant cells (for example by transduction, transfection or transformation) are 
well known to those skilled in the art. Such techniques include Agrobacterium 
mediated introduction, electroporation to tissues, cells and protoplasts, 
10 protoplast fusion, injection into reproductive organs, injection into immature 
embryos and high velocity projectile introduction to cells, tissues, calli, 
immature and mature embryos. The choice of technique will depend largely 
on the type of plant to be transformed. 

Cells incorporating the vector of the present invention may be selected, 
15 as described above, and then cultured in an appropriate medium to regenerate 
transformed plants, using techniques well known in the art. The culture 
conditions, such as temperature, pH and the like, will be apparent to the 
person skilled in the art. The resulting plants may be reproduced, either 
sexually or asexually, using methods well known in the art, to produce 
20 successive generations of transformed plants. 

In a further aspect of the present invention there is provided a plant cell, 
plant, plant seed or other plant part, including, eg transformed with, a vector of 
the present invention. 

The plant cell, plant, plant seed or other plant part may be from any 
25 suitable species, including monocotyledons, dicotyledons and gymnosperms. 
In a preferred embodiment the plant cell, plant, plant seed or other plant part 
may be from a monocotyledon, preferably a grass species, more preferably a 
ryegrass (Lolium species) or fescue (Festuca species), even more preferably a 
ryegrass, most preferably perennial ryegrass, including forage- and turf-type 
30 cultivars. 
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The present invention also provides a plant, plant seed or other plant 
part derived from a plant cell of the present invention. 

The present invention also provides a plant, plant seed or other plant 
part derived from a plant of the present invention. 

5 In a further aspect of the present invention there is provided a method of 

modifying lignin biosynthesis in a plant, said method including introducing into 
said plant an effective amount of a nucleic acid or nucleic acid fragment and/or 
a vector according to the present invention. 

By "an effective amount" is meant an amount sufficient to result in an 
10 identifiable phenotypic trait in said plant, or a plant, plant seed or other plant 
part derived therefrom. Such amounts can be readily determined by an 
appropriately skilled person, taking into account the type of plant, the route of 
administration and other relevant factors. Such a person will readily be able to 
determine a suitable amount and method of administration. See, for example, 
15 Maniatis et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, the entire disclosure of which is incorporated 
herein by reference. 

Using the methods and materials of the present invention, plant lignin 
biosynthesis may be increased, decreased or otherwise modified relative to an 

20 untransformed control plant. It may be increased or otherwise modified, for 
example, by incorporating additional copies of a sense, nucleic acid or nucleic 
acid fragment of the present invention. It may be decreased, for example, by 
incorporating an antisense nucleic acid or nucleic acid fragment of the present 
invention. In addition, the number of copies of genes encoding for different 

25 enzymes in the lignin biosynthetic pathway may be manipulated to modify the 
relative amount of each monolignol synthesized, thereby leading to the 
formation of lignin having altered composition. 

In a still further aspect of the present invention there is provided use of a 
nucleic acid or nucleic acid fragment according to the present invention, and/or 
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nucleotide sequence information thereof, and/or single nucleotide 
polymorphisms thereof, as a molecular genetic marker. 

More particularly, nucleic acids or nucleic acid fragments according to 
the present invention, and/or nucleotide sequence information thereof, and/or 
5 single nucleotide polymorphisms thereof, may be used as a molecular genetic 
marker for qualitative trait loci (QTL) tagging, mapping, DNA fingerprinting and 
in marker assisted selection, and may be used as candidate genes or perfect 
markers, particularly in ryegrasses and fescues. Even more particularly, 
nucleic acids or nucleic acid fragments according to the present invention, 
10 and/or nucleotide sequence information thereof, may be used as molecular 
genetic markers in forage and turf grass improvement, eg. tagging QTLs for 
dry matter digestibility, herbage quality, mechanical stress tolerance, disease 
resistance, insect pest resistance, plant stature and leaf and stem colour. 

In a still further aspect of the present invention there is provided a 
15 substantially purified or isolated polypeptide from a ryegrass (Lolium) or fescue 
(Fustuca) species, selected from the group consisting of the enzymes 4CL, 
CCR and CAD. 

The ryegrass (Lolium) or fescue (Festuca) species may be of any 
suitable type, including Italian or annual ryegrass, perennial ryegrass, tall 
20 fescue, meadow fescue and red fescue. Preferably the species is a ryegrass, 
more preferably perennial ryegrass L perenne). 

In a preferred embodiment of this aspect of the invention, the 
substantially purified or isolated enzyme 4CL includes an amino acid sequence 
selected from the group consisting of sequences shown in Figures 2, 3 and 4 
25 hereto (Sequence ID Nos: 2, 4 and 6, respectively); and functionally active 
fragments and variants thereof. 

In a further preferred embodiment of this aspect of the invention, the 
substantially purified or isolated enzyme CCR includes an amino acid 
sequence selected from the group consisting of the sequence shown in Figure 



WO 01/95702 



PCT/AU01/00699 



- 11 - 

10 hereto (Sequence ID No: 8); and functionally active fragments and variants 
thereof. 

In a still further preferred embodiment of this aspect of the invention, the 
substantially purified or isolated enzyme CAD includes an amino acid 
5 sequence selected from the group consisting of the sequence shown in 
Figures 13, 14, 26 and 27 hereto (Sequence ID Nos: 10, 12, 15 and 17, 
respectively); and functionally active fragments and variants thereof. 

By "functionally active" in this context is meant that the fragment or 
variant has one or more of the biological properties of the enzymes 4CL, CCR 

10 and CAD, respectively. Additions, deletions, substitutions and derivatizations 
of one or more of the amino acids are contemplated so long as the 
modifications do not result in loss of functional activity of the fragment or 
variant. Preferably the fragment or variant has at least approximately 60% 
identity to the relevant part of the above mentioned sequence, more preferably 

15 at least approximately 80% identity, most preferably at least approximately 
90% identity. Such functionally active variants and fragments include, for 
example, those having conservative amino acid substitutions of one or more 
residues in the corresponding amino acid sequence. Preferably the fragment 
has a size of at least 10 amino acids, more preferably at least 15 amino acids, 

20 most preferably at least 20 amino acids. 

In a further embodiment of this aspect of the invention, there is provided 
a polypeptide recombinantly produced from a nucleic acid or nucleic acid 
fragment according to the present invention. Techniques for recombinantly 
producing polypeptides are well known to those skilled in the art. 

25 In a still further aspect of the present invention there is provided a lignin 

or modified lignin substantially or partially purified or isolated from a plant, 
plant seed or other plant part of the present invention. 

Such lignins may be modified from naturally occurring lignins in terms of 
the length, the degree of polymerisation (number of units), degree of branching 
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and/or nature of linkages between units. 

In a still further aspect, the present invention provides an isolated 
regulatory element capable of causing expression of an exogenous gene in 
plant cells. Preferably the regulatory element is isolated from a nucleic acid or 
5 nucleic acid fragment encoding OMT, 4CL, CCR or CAD. 

The regulatory element may be a nucleic acid molecule, including DNA 
(such as cDNA or genomic DNA) and RNA (such as mRNA) that is single- or 
double- stranded, optionally containing synthetic, non-natural or altered 
nucleotide bases, and combinations thereof. 

10 Preferably the regulatory element includes a promoter, more preferably 

an Omethy (transferase promoter, even more preferably an O- 
methyltransferase promoter from a ryegrass (Lolium) or fescue (Festuca) 
species, more preferably a ryegrass, most preferably perennial ryegrass 
(Lolium perenne). 

15 In a particularly preferred embodiment of this aspect of the invention, 

the regulatory element includes a promoter from the caffeic acid O 
methyltransferase gene corresponding to the cDNA homologue LpOMTI from 
perennial ryegrass. 

Preferably the regulatory element includes a nucleotide sequence 
20 including the first approximately 4630 nucleotides of the sequence shown in 
Figure 18 hereto (Sequence ID No: 13); or a functionally active fragment or 
variant thereof. 

By "functionally active" in this context is meant that the fragment or 
variant (such as an analogue, derivative or mutant) is capable of causing 
25 expression of a transgene in plant cells. Such variants include naturally 
occurring allelic variants and non-naturally occurring variants. Additions, 
deletions, substitutions and derivatizations of one or more of the nucleotides 
are contemplated so long as the modifications do not result in loss of functional 
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activity of the regulatory element. Preferably the functionally active fragment or 
variant has at least approximately 80% identity to the relevant part of the 
above sequence, more preferably at least approximately 90% identity, most 
preferably at least approximately 95% identity. Preferably the fragment has a 
5 size of at least 1 00 nucleotides, more preferably at least 1 50 nucleotides, most 
preferably at least 200 nucleotides. 

In a particularly preferred embodiment of this aspect of the invention, 
the regulatory element includes a nucleotide sequence selected from the 
group consisting of: 

10 Nucleotides - 4581 to -1 

Nucleotides -4285 to -1 

Nucleotides -4020 to -1 

Nucleotides -2754 to -1 

Nucleotides - 1 810 to -1 
15 Nucleotides -831 to -1 

Nucleotides -560 to -1 

Nucleotides -525 to -1 

Nucleotides -274 to -1 

Nucleotides -21 to -1 

20 of Figure 18 hereto (Sequence ID No: 13); 

or a functionally active fragment or variant thereof. 

In another preferred embodiment the regulatory element includes a 4 
coumarate-CoA ligase promoter, even more preferably a 4 coumarate-CoA 
ligase promoter from a ryegrass (Lolium) or fescue (Festuca) species, more 
25 preferably a ryegrass, most preferably perennial ryegrass (Lolium perenne). 
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In a particularly preferred embodiment of this aspect of the invention, 
the regulatory element includes a promoter from the 4 coumarate-CoA ligase 
gene corresponding to the cDNA homologue Lp4CL2 from perennial ryegrass. 

Preferably the regulatory element includes a nucleotide sequence 
5 including the first approximately 2206 nucleotides of the sequence shown in 
Figure 38 hereto (Sequence ID No: 17); or a functionally active fragment or 
variant thereof. 

By "functionally active" in this context is meant that the fragment or 
variant (such as an analogue, derivative or mutant) is capable of causing 

10 expression of a transgene in plant cells. Such variants include naturally 
occurring allelic variants and non-naturally occurring variants. Additions, 
deletions, substitutions and derivatizations of one or more of the nucleotides 
are contemplated so long as the modifications do not result in loss of functional 
activity of the regulatory element. Preferably the functionally active fragment or 

15 variant has at least approximately 80% identity to the relevant part of the 
above sequence, more preferably at least approximately 90% identity, most 
preferably at least approximately 95% identity. Preferably the fragment has a 
size of at least 100 nucleotides, more preferably at least 150 nucleotides, most 
preferably at least 200 nucleotides. 

20 In a particularly preferred embodiment of this aspect of the invention, 

the regulatory element includes a nucleotide sequence selected from the 
group consisting of: 

Nucleotides - 2206 to -1 
Nucleotides -1546 to -1 
25 Nucleotides -1 1 86 to -1 

Nucleotides -406 to -1 
Nucleotides - 1 66 to -1 
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of Figure 38 hereto (Sequence ID No: 17); 

or a functionally active fragment or variant thereof. 

In another preferred embodiment the regulatory element includes a 
cinnamoyl-CoA reductase promoter, even more preferably a cinnamoyl-CoA 
5 reductase promoter from a ryegrass (Lolium) or fescue (Festuca) species, 
more preferably a ryegrass, most preferably perennial ryegrass (Lolium 
perenne). 

In a particularly preferred embodiment of this aspect of the invention, 
the regulatory element includes a promoter from the cinnamoyl-CoA reductase 
10 gene corresponding to the LpCCRI cDNA from perennial ryegrass. 

Preferably the regulatory element includes a nucleotide sequence 
including the first approximately 6735 nucleotides of the sequence shown in 
Figure 39 hereto (Sequence ID No: 18); or a functionally active fragment or 
variant thereof. 

15 By "functionally active" in this context is meant that the fragment or 

variant (such as an analogue, derivative or mutant) is capable of causing 
expression of a transgene in plant cells. Such variants include naturally 
occurring allelic variants and non-naturally occurring variants. Additions, 
deletions, substitutions and derivatizations of one or more of the nucleotides 

20 are contemplated so long as the modifications do not result in loss of functional 
activity of the regulatory element. Preferably the functionally active fragment or 
variant has at least approximately 80% identity to the relevant part of the 
above sequence, more preferably at least approximately 90% identity, most 
preferably at least approximately 95% identity. Preferably the fragment has a 

25 size of at least 100 nucleotides, more preferably at least 150 nucleotides, most 
preferably at least 200 nucleotides. 
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ln a particularly preferred embodiment of this aspect of the invention, 
the regulatory element includes a nucleotide sequence selected from the 
group consisting of: 

Nucleotides - 6735 to -1 
5 Nucleotides -5955 to -1 

Nucleotides -5415 to -1 

Nucleotides -4455 to -1 

Nucleotides - 4035 to -1 

Nucleotides -3195 to -1 
10 Nucleotides -2595 to -1 

Nucleotides -1755 to -1 

Nucleotides -1275 to -1 

Nucleotides -495 to -1 

Nucleotides -255 to -1 
15 Nucleotides -75 to -1 

of Figure 39 hereto (Sequence ID No: 18); 

or a functionally active fragment or variant thereof. 

By an "exogenous gene" is meant a gene not natively linked to said 
regulatory element. In certain embodiments of the present invention the 
20 exogenous gene is also not natively found in the relevant plant or plant cell. 

The exogenous gene may be of any suitable type. The exogenous gene 
may be a nucleic acid such as DNA (e.g. cDNA or genomic DNA) or RNA (e.g. 
mRNA), and combinations thereof. The exogenous gene may correspond to a 
target gene, for example a gene capable of influencing disease resistance, 
25 herbage digestibility, nutrient quality, mineral content or drought tolerance or 
be a fragment or variant (such as an analogue, derivative or mutant) thereof 
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which is capable of modifying expression of said target gene. Such variants 
include nucleic acid sequences which are antisense to said target gene or an 
analogue, derivative, mutant or fragment thereof. The transgene may code for 
a protein or RNA sequence depending the target condition and whether down 
5 or up-regulation of gene expression is required. Preferably, the target gene is 
selected from exogenous coding sequences coding for mRNA for a protein, 
this protein may be of bacterial origin (such as enzymes involved in cell wall 
modification and cell wall metabolism, cytokinin biosynthesis), or eukaryotic 
origin (such as pharmaceutical^ active polypeptides) or of plant origin (such as 

10 enzymes involved in the synthesis of phenolic compounds, cell wall 
metabolism, sugar metabolism, lignin biosynthesis). Preferably, the target 
gene is selected from the group comprising O-methyltransf erase, 4 coumarate 
CoA-ligase, cinnamoyl CoA reductase, cinnamyl alcohol dehydrogenase, 
cinnamate 4 hydroxylase, phenolase, laccase, peroxidase, coniferol glucosyl 

15 transferase, coniferin beta-glucosidase, phenylalanine ammonia lyase, ferulate 
5-hydroxylase, chitinase, glucanase, isopentenyltransferase, xylanase. 

The plant cells, in which the regulatory element of the present invention 
is capable of causing expression of an exogenous gene, may be of any 
suitable type. The plant cells may be from monocotyledons (such as grasses 

20 from the genera Lolium, Festuca, Paspalum, Pennisetum, Panicum and other 
forage and turf grasses, corn, grains, oat, sugarcane, wheat and barley), 
dicotyledons (such as arabidopsis, tobacco, legumes, alfalfa, oak, eucalyptus 
and maple) and gymnosperms. Preferably the plant cells are from a 
monocotyledon, more preferably a grass species such as a ryegrass (Lolium) 

25 or fescue (Festuca) species, even more preferably a ryegrass, most preferably 
perennial ryegrass (Lolium perenne). 

The regulatory element according to the present invention may be used 
to express exogenous genes to which it is operatively linked in the production 
of transgenic plants. 

30 Accordingly, in a further aspect of the present invention there is 

provided a vector including a regulatory element according to the present 
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invention. 

In a preferred embodiment of this aspect of the invention, the vector 
may include a regulatory element according to the present invention, an 
exogenous gene as hereinbefore described, and a terminator; said regulatory 
5 element, exogenous gene and terminator being operatively linked, such that 
said regulatory element is capable of causing expression of said exogenous 
gene in plant cells. Preferably, said regulatory element is upstream of said 
exogenous gene and said terminator is downstream of said exogenous gene. 

The vector may be of any suitable type and may be viral or non-viral. 

10 The vector may be an expression vector. Such vectors include chromosomal, 
non-chromosomal and synthetic nucleic acid sequences, eg. derivatives of 
plant viruses; bacterial plasmids; derivatives of the Ti plasmid from 
Agrobacterium tumefaciens; derivatives of the Ri plasmid from Agrobacterium 
rhizogenes; phage DNA; yeast artificial chromosomes; bacterial artificial 

15 chromosomes; binary bacterial artificial chromosomes; vectors derived from 
combinations of plasmids and phage DNA. However, any other vector may be 
used as long as it is replicable on integrative or viable in the plant cell. 

The terminator may be of any suitable type and includes for example 
polyadenylation signals, such as the Cauliflower Mosaic Virus 35S polyA 
20 (CaMV 35S polyA) and other terminators from the nopaline synthase (nos) and 
the octopine synthase (ocs) genes. 

The vector, in addition to the regulatory element, the exogenous nucleic 
acid and the terminator, may include further elements necessary for 
expression of the nucleic acid, in different combinations, for example vector 

25 backbone, origin of replication (ori), multiple cloning sites, spacer sequences, 
enhancers, introns (such as the maize Ubiquitin Ubi intron), antibiotic 
resistance genes and other selectable marker genes [such as the neomycin 
phosphotransferase {npt2) gene, the hygromycin phosphotransferase {hph) 
gene, the phosphinothricin acetyltransferase (bar or pat) gene], and reporter 

30 genes (such as beta-glucuronidase (GUS) gene (gusA)]. The vector may also 
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contain a ribosome binding site for translation initiation. The vector may also 
include appropriate sequences for amplifying expression. 

The regulatory element of the present invention may also be used with 
other full promoters or partial promoter elements. 

5 As an alternative to use of a selectable marker gene to provide a 

phenotypic trait for selection of transformed host cells, the presence of the 
vector in transformed cells may be determined by other techniques well known 
in the art, such as PCR (polymerase chain reaction), Southern blot 
hybridisation analysis, histochemical GUS assays, northern and Western blot 
10 hybridisation analyses. 

Those skilled in the art will appreciate that the various components of 
the vector are operatively linked, so as to result in expression of said 
transgene. Techniques for operatively linking the components of the vector of 
the present invention are well known to those skilled in the art. Such 
15 techniques include the use of linkers, such as synthetic linkers, for example 
including one or more restriction sites. 

The vectors of the present invention may be incorporated into a variety 
of plants, including monocotyledons, dicotyledons and gymnosperms. In a 
preferred embodiment the vectors are used to transform monocotyledons, 
20 preferably grass species such as ryegrasses (Lolium species) and fescues 
{Festuca species), more preferably perennial ryegrass (Lolium perenne) 
including forage- and turf- type cultivars. 

Techniques for incorporating the vectors of the present invention into 
plant cells (for example by transduction, transfection or transformation) are 
25 well known to those skilled in the art. Such techniques include Agrobacterium 
mediated introduction, electroporation to tissues, cells and protoplasts, 
protoplast fusion, injection into reproductive organs, injection into immature 
embryos and high velocity projectile introduction to cells, tissues, calli, 
immature and mature embryos. The choice of technique will depend largely 
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on the type of plant to be transformed. 

Cells incorporating the vector of the present invention may be selected, 
as described above, and then cultured in an appropriate medium to regenerate 
transformed plants, using techniques well known in the art. The culture 
5 conditions, such as temperature, pH and the like, will be apparent to the 
person skilled in the art. The resulting plants may be reproduced, either 
sexually or asexually, using methods well known in the art, to produce 
successive generations of transformed plants. 

In a further aspect of the present invention there is provided a plant cell, 
10 plant, plant seed or other plant part, including, eg. transformed with, a vector of 
the present invention. 

The plant cell, plant, plant seed or other plant part may be from any 
suitable species, including monocotyledons, dicotyledons and gymnosperms. 
In a preferred embodiment the plant cell, plant, plant seed or other plant part is 
15 from a monocotyledon, preferably a grass species, more preferably a ryegrass 
(Lolium species) or fescue (Festuca species), even more preferably perennial 
ryegrass (Lolium perenne), including forage- and turf-type cultivars. 

The present invention also provides a plant, plant seed, or other plant 
part derived from a plant cell of the present invention. 

20 The present invention also provides a plant, plant seed or other plant 

part derived from a plant of the present invention. 

In a still further aspect of the present invention there is provided a 
recombinant plant genome including a regulatory element according to the 
present invention. 

25 In a preferred embodiment of this aspect of the invention the 

recombinant plant genome further includes an exogenous gene operatively 
linked to said regulatory element. 
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In a further aspect of the present invention there is provided a method 
for expressing an exogenous gene in plant cells, said method including 
introducing into said plant cells an effective amount of a regulatory element 
and/or a vector according to the present invention. 

5 By "an effective amount" is meant an amount sufficient to result in an 

identifiable phenotypic change in said plant cells or a plant, plant seed or other 
plant part derived therefrom. Such amounts can be readily determined by an 
appropriately skilled person, taking into account the type of plant cell, the route 
of administration and other relevant factors. Such a person will readily be able 
10 to determine a suitable amount and method of administration. See, for 
example, Maniatis et al, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, the entire disclosure of which is 
incorporated herein by reference. 

The present invention will now be more fully described with reference to 
15 the accompanying Examples and drawings. It should be understood, however, 
that the description following is illustrative only and should not be taken in any 
way as a restriction on the generality of the invention described above. 

In the Figures 

Figure 1 shows plasmid maps of the three cDNAs encoding perennial 
20 ryegrass 4CL homologues. 

Figure 2 shows the nucleotide (Sequence ID No: 1) and amino acid 
(Sequence ID No: 2) sequences of Lp4CL1. 

Figure 3 shows the nucleotide (Sequence ID No: 3) and amino acid 
(Sequence ID No: 4) sequences of Lp4CL2. 

25 Figure 4 shows the nucleotide (Sequence ID No: 5) and amino acid 

(Sequence ID No: 6) sequences of Lp4CL3. 
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Figure 5 shows amino acid sequence alignment of deduced proteins 
encoded by Lp4CL1 (Sequence ID No: 2), Lp4CL2 (Sequence ID No: 4) and 
Lp4CL3 (Sequence ID No: 6). 

Figure 6 shows northern hybridisation analysis of developing perennial 
5 ryegrass using Lp4CL1, Lp4CL2 and Lp4CL3 as hybridisation probes. SR: 
roots from seedlings (3-5 d post-germination), SS: shoots from seedlings (3-5 
d post-germination), ML: leaves from 12-week-old plants, MS: stems from 12- 
week-old plants. Blots were washed in 0.2 X SSPE, 0.1 % SDS at 65 °C. 
Lp4CL1 , Lp4CL2 and Lp4CL3 do not cross hybridise at this stringency. Sizes 
10 are given in kb. 

Figure 7 shows northern hybridisation analysis showing the time course 
of expression of 4CL mRNA in wounded perennial ryegrass leaves. Sizes are 
given in kb. 

Figure 8 shows genomic Southern hybridisation analysis using Lp4CL1, 
15 Lp4CL2 and Lp4CL3 as hybridisation probes. 10 [xg of digested perennial 
ryegrass genomic DNA or 20 \ig of digested tall fescue genomic DNA were 
separated on a 1 .0 % agarose gel, transferred to Hybond N + membranes and 
then hybridised with 32 P labelled Lp4CL1, Lp4CL2 or Lp4CL3 probes. The 
ryegrass Lp4CL1 , Lp4CL2 and Lp4CL3 genes reveal homologous sequences 
20 in tall fescue and indicate that the ryegrass 4CL genes can be used to isolate 
and to manipulate the expression of the tall fescue (Festuca arundinacea) 4CL 
genes. 

Figure 9 shows restriction map of LpCCRI. An L perenne seedling 
cDNA library constructed in Uni-ZAP™ (Stratagene) was screened in a 
25 solution containing 10xPIPES, 50% deionised formamide and 10% SDS at 
42°C. Filters were washed at room temperature, three times in 0.1% SDS, 2x 
SSPE and then twice in 0.1% SDS, 0.2x SSPE. The location of the probe 
used for northern and Southern hybridisation analyses is indicated by the black 
line labelled LpCCR531. 
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Figure 10 shows the nucleotide (Sequence ID No: 7) and amino acid 
(Sequence ID No: 8) sequences of LpCCRI . 

Figure 1 1 shows Southern hybridisation analysis of DNA from double 
haploid (DH) perennial ryegrass using LpCCFM as hybridisation probe. 10^g 
5 of DH genomic DNA was digested with Dral, BamHI, EcoRI, EcoRV, Hindlll or 
Xbal, separated on a 1% agarose gel and then capillary blotted onto nylon 
membrane (Amersham Hybond-N). The membrane was probed with the 
digoxigenin (DIG) labelled LpCCR531 fragment at 25ng/ml in the hybridisation 
solution. Hybridisation was in 4x SSC, 50% formamide, 0.1% N-Lauroyl- 
10 sarcosine, 0.02% SDS, 2% Blocking solution at 42°C. The membrane was 
washed twice for five minutes in 2x SSC, 0.1% SDS at room temperature, then 
twice for fifteen minutes in 0.5x SSC, 0.1 %SDS at 68°C. Molecular weight was 
determined by comparison to a DIG-labelled marker (Roche Molecular 
Biochemicals). 

15 Figure 12 shows northern hybridisation analysis of RNA samples from 

different organs and developmental stages of perennial ryegrass using 
LpCCRI probe. Roots from seedlings (3-5 d post-germination), shoots from 
seedlings (3-5 d post-germination), roots from seedlings (7-10 d post- 
germination), leaves from seedlings (7-10 d post-germination), roots from 6 

20 and 1 0 week old plants, leaves from 6 and 1 0 week old plants, stems from 6 
and 10 week old plants, whole seedling from 1 1 day old Phalaris and 7 day old 
Festuca. 

Total RNA was isolated using Trizol (GibcoBRL) and 15 \ig was 
separated on a 1 .2% Agarose gel containing 6% formamide and then capillary 

25 blotted onto nylon membrane (Amersham Hybond-N). The membrane was 
stained with 0.2% methylene blue/0.3M sodium acetate to visualise the marker 
and ensure that RNA was evenly loaded. 50 ng LpCCR531 was random- 
labelled with 32 P-dCTP (Amersham Megaprime) and hybridisation conditions 
were 4x SSC, 50% formamide, 0.5% SDS, 5x denhardt solution, 5% dextrane 

30 sulphate, 0.1% Herring sperm DNA at 42°C over-night. The ryegrass LpCCRI 
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gene reveal homologous transcripts in tall fescue and Phalaris, thus indicating 
that the ryegrass CCR gene can be used to manipulate the expression of the 
tall fescue {Festuca arundinacea) and Phalaris CCR endogenous genes. 

Figure 13 shows the nucleotide (Sequence ID No: 9) and amino acid 
5 (Sequence ID No: 10) sequences of LpCADI. 

Figure 14 shows the nucleotide (Sequence ID No: 11) and amino acid 
(Sequence ID No: 12) sequences of LpCAD2. 

Figure 15 shows a plasmid map of a cDNA clone encoding perennial 
ryegrass CAD homologue LpCADI . 

10 Figure 16 shows northern hybridisation analysis of RNA samples from 

different organs and developmental stages of perennial ryegrass using A) 
LpCADI and B) LpCAD2 as hybridisation probes. Roots from seedlings 3-5 d 
post-germination, 7-10 d post-germination, 6 weeks and 10 weeks, Shoots 
from seedlings 3-5 d post-germination and 7-10 d post-germination, Leaves 

15 from 6 week old and 10 week old plants, stem tissue from 6 and 10 week old 
plants. RNA isolated from Phalaris and Festuca 1 1 and 7 day old seedlings. 
The ryegrass CAD genes reveal homologous transcripts in tall fescue and 
Phalaris, thus indicating that the ryegrass CAD gene can be used to 
manipulate the expression of the tall fescue and Phalaris CAD endogenous 

20 genes. 

Figure 17 shows genomic Southern hybridisation analysis. 10 \ig of 
perennial ryegrass genomic DNA digested with a range of restriction enzymes 
was separated on a 0.8% agarose gel, transferred to Hybond N and then 
hybridised with a DIG labelled A) LpCADI, and B) LpCAD2 hybridisation 
25 probe. 

Figure 18 shows the nucleotide sequence of the LpOmtl promoter 
(Sequence ID No: 13). 
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Figure 19 shows a plasmid map of plant transformation vector carrying 
the reporter p-glucuronidase (GUS) gene {gusA) under control of the perennial 
ryegrass LpOmtl promoter. 

Figure 20 (upper image) shows PCR analysis of transgenic tobacco 
5 plants containing the gusA gene under the control of the perennial ryegrass 
LpOMTI promoter (upper figure). PCR reactions using grusy4-specific primers 
were performed. Figure 20 (lower images) show histochemical GUS assays, 
demonstrating xylem-specific gusA expression (A and B) and gusA expression 
in glandular leaf trichomes (C and D) in transgenic tobacco plants containing 
10 the gusA gene under the control of the perennial ryegrass LpOMTI promoter. 

Figure 21 shows the isolation of the LpCCRI genomic clone 1. A) 
Southern hybridization analysis of CCR genomic clone XLp6.1 .1a digested with 
Xba\, Ncol, Sail, Xho\, Xho\/SaH DNA was separated on a 0.8% agarose gel, 
transferred to Hybond N and hybridized with a DIG labelled CCR1 probe. B) 
15 Map showing the genomic gene organisation of LpCCRI clone 1 based on 
sequence results. C) Comparison of plant CCR exon size and number in 
different plant species (Lolium perenne, Lp., Eucalyptus gunni, Eg., Eucalyptus 
saligna, Es., Populus balsamifera, Pb.) 

Figure 22 shows the isolation of the LpCCRI genomic clone 2. A) 
20 Southern hybridization analysis of CCR genomic clone XLp6.1 .1 a digested with 
Xba\, Ncol, Sa/I, Xho\, XhoUSali DNA was separated on a 0.8% agarose gel, 
transferred to Hybond N and hybridized with 200bp of the CCR1 promoter 
(Figure 21 B). B) Map showing the promoter region of LpCCRI clone 2 based 
on sequence results. 

25 Figure 23 shows the isolation of an Lp4CL genomic clone. A) Southern 

hybridisation analysis of 4CL genomic clone XLp4CL2 digested with BamHI, 
Kpnl or Sail. DNA was separated on a 0.8% agarose gel, transferred to 
Hybond N and hybridized with a DIG labelled 4CL1 hybridisation probe. B) 
10fil of a standard PCR reaction using forward and reverse oligonucleotides 
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designed to positions outlined on C). The PCR products were separated on a 
0.8% agarose gel and stained with ethidium bromide. C) Map showing the 
genomic gene organisation of XLp4CL2 based on sequence and PCR results. 

Figure 24 shows the isolation of an Lp4CL genomic clone. A) Southern 
5 hybridisation analysis of 4CL genomic clone A,Lp4CL2 digested with BamHI, 
Kpnl, Sail. DNA was separated on a 0.8% agarose gel, transferred to Hybond 
N and hybridized with a DIG labelled 4CL1 probe. B) Map showing the 
genomic gene organisation of Lp4CL2 clone 1 and the promoter region of 
clone 2. 

10 Figure 25 shows plasmid map of plant transformation vector carrying 

the gusA gene under control of the perennial ryegrass Lp4CL2 promoter 
(Lp4CL2::gusA). 

Figure 26 shows nucleotide (Sequence ID No: 14) and amino acid 
(Sequence ID No: 15) sequences of genomic clone CAD2 cv Barlano (Intron 1 
15 and first 111 bp of the coding region are missing). 

Figure 27 shows nucleotide (Sequence ID No: 16) and amino acid 
(Sequence ID No:15) sequences of coding sequence deduced from genomic 
clone CAD2 cv Barlano (region in bold is missing from the genomic clone). 

Figure 28 shows the isolation of LpCAD2 genomic clone. A) Southern 
20 hybridization analysis of CAD genomic clone XLpCAD2 digested with BamHI, 
EcoRI, Kpnl, Sail or Xbal. DNA was separated on a 0.8% agarose gel, 
transferred to Hybond N and hybridized with a DIG labelled CAD2 
hybridisation probe. B) Map showing the genomic gene organisation of 
XLpCAD2 based on sequence results. 

25 Figure 29 shows A) Sense and antisense Lp4CL1 , Lp4CL2 and Lp4CL3 

transformation vectors under control of the CaMV 35S promoter; B) Sense and 
antisense Lp4CL1 , Lp4CL2 and Lp4CL3 transformation vectors under control 
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of the maize ubiquitin promoter. 

Figure 30 shows A) Sense and antisense LpCCRI transformation 
vectors under control of the CaMV 35S promoter; B) Sense and antisense 
LpCCRI transformation vectors under control of the maize ubiquitin promoter. 

5 Figure 31 shows A) Sense and antisense LpCADI transformation 

vectors under control of the CaMV 35S promoter; B) Sense and antisense 
LpCADI transformation vectors under control of the maize ubiquitin promoter. 

Figure 32 shows molecular analysis of Lp4CL1 -transgenic tobacco. A) 
Plasmid map of transformation vector carrying a chimeric sense Lp4CL1 gene. 
10 B) PCR analysis of independent transgenic tobacco clones using Lp4CL1 
specific primers. C) Southern hybridization analysis of independent transgenic 
tobacco plants using an Lp4CL1 specific probe. D) Northern hybridization 
analysis of independent transgenic tobacco plants using an Lp4CL1 specific 
probe. 

15 Figure 33 shows molecular analysis of LpCCRI -transgenic tobacco. A) 

Plasmid map of transformation vectors carrying a chimeric sense and 
antisense LpCCRI gene. B)> PCR analysis of independent sense transgenic 
tobacco clones using LpCCRI specific primers. 

Figure 34 shows protocol for suspension culture-independent 
20 production of transgenic perennial ryegrass plants. A) Isolated zygotic 
embryos, plated on MSM5 medium, day 0; B) Embryogenic callus formation 
and proliferation, 6-8 weeks after embryo isolation; C) Embryogenic calli 
arranged on high osmotic MSM3Plus medium prior to biolistic transformation; 
D) Histochemical GUS assay showing GUS expressing foci 3-4 days post- 
25 bombardment of chimeric gusA gene; E) Selection of embryogenic calli on 
MSM3 medium containing 100 mg/l paromomycin (Pm), 2 weeks after 
microprojectile bombardment; F) Regeneration of Pm resistant shoots on MSK 
medium containing 100 mg/l Pm, 4 weeks after microprojectile bombardment; 
G) In vitro plant regeneration from PM resistant embryogenic calli, 6 weeks 
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after microprojectile bombardment; H) Transgenic perennial ryegrass plants 28 
weeks after embryo isolation. 

Figure 35 shows molecular analysis of transgenic perennial ryegrass 
plants carrying sense and antisense LpOmtl transgenes. Plasmid maps of 
5 vectors used for the co-transformation of perennial ryegrass embryogenic calli; 
pHP23 carrying a chimeric neomycin phosphotransferase (npt2) selectable 
marker gene; pUbiomtl carrying a maize ubiquitin promoter driven sense 
LpOmtl gene; pUbitmol carrying a maize ubiquitin promoter driven antisense 
LpOmtl gene (top). PCR analysis using /?pf2-specific primers of 5 independent 

10 transgenic perennial ryegrass plants from biolistic transformation with sense 
and antisense LpOmtl vectors (upper centre). Southern hybridization analysis 
with an omfl hybridization probe of 7 independent perennial ryegrass plants 
co-transformed with sense (lanes 1-3) and antisense (lanes 4-7) LpOmtl 
vectors (lower centre left). Southern hybridisation analysis with an npt2 

15 hybridisation probe of independent perennial ryegrass plants (lower centre 
right). Northern hybridisation analysis of perennial ryegrass plants co- 
transformed with antisense LpOmtl vector (bottom). C = negative control 
untransformed perennial ryegrass; P = positive plasmid control. 

Figure 36 shows biochemical analysis of LpOmtl -transgenic perennial 
20 ryegrass. OMT activity of leaf samples from selected independent LpOmtl - 
transgenic perennial ryegrass plants (EII8, EII11, EII14 and EII15) was 
determined and compared to untransformed perennial ryegrass negative 
control plant L. perenne cv. Ellett (wild type). Mean values and standard 
deviations of replicate assays are shown. 

25 Figure 37 shows PCR screening of transgenic ryegrass plants. PCR 

analysis using np£2-specific primers of 8 independent transgenic perennial 
ryegrass plants from biolistic transformation with antisense LpUbi4CL2 vector. 

Figure 38 shows the nucleotide sequence of genomic clone 4CL2 from 
perennial ryegrass (Sequence ID No: 17). 
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Figure 39 shows the nucleotide sequence of genomic clone CCR1 from 
perennial ryegrass (Sequence ID No: 18). 

Figure 40 shows the map location of Lp4CL1, Lp4CL3, LpCADI, 
LpCAD2, LpCCFM , LpOMTI and LpOMT2 (in bold) within the genetic linkage 
5 map of perennial ryegrass. 

EXAMPLE 1 

Isolation and characterisation of three 4-Coumarate CoA-Ligase (4CL) 

cDNAs from Lolium perenne 

Materials and Methods 

10 Plant material 

Plants and embryogenic cell suspensions of perennial ryegrass {Lolium 
perenne L.) cv Ellet and tall fescue (Festuca arundinacea Schreb.) cv Triumph 
were established and maintained as previously described (Heath et al., 1998). 
Wounding experiments were performed with 10-day-old seedlings of perennial 
15 ryegrass (cv Ellet) as previously described (Heath et al., 1998). 

Screening of a cDNA library 

A cDNA library prepared with RNA isolated from perennial ryegrass 
seedlings (Heath et al., 1998) was screened with a [ 32 P]dCTP-labelled rice 
partial 4CL probe. The rice 4CL probe and consisted of a 844 bp 4CL specific 
20 sequence inserted into PUC1 19. This insert has 93 % sequence identity with a 
rice 4CL cDNA sequence (Genbank, L43362, bases 453-1300). cDNA inserts 
were excised and recircularized using the ExAssist helper phage with SOLR 
strain (Stratagene) as described by the manufacturer. 

DNA sequencing 

25 cDNA clones were digested with 8 restriction enzymes (BamH\, EcoR\, 

Kpn\, Not\, Psfl, Saft, Xba\, Xho\) and selected clones were sequenced on both 
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strands by the dideoxy chain termination method using M13 forward and 
reverse primers. For sequencing the internal regions of Lp4CL1 , Lp4CL2 and 
Lp4CL3 synthetic oligonucleotide primers were designed from the DNA 
sequences previously determined. Sequencing was performed using the ABI 
5 dye terminator kit and automatic sequencer. Nucleotide sequences were 
aligned using the SeqEd program (ABI) and further analysis was performed 
using the HIBIO DNASIS vs2 program (Hitachi Software Engineering). 

Genomic DNA blot analysis 

Genomic DNA was isolated from single genotype-derived cell 
10 suspensions of perennial ryegrass and tall fescue according to Lichtenstein 
and Draper (1985). Ten \ig of perennial ryegrass DNA and 20 \ig of tall fescue 
DNA was digested with each of the restriction enzymes HindWl and Xba\, 
separated on 1 % agarose gels, and transferred to Hybond N + membranes 
according to the manufacturer's instructions (Amersham). Probes consisted of 
15 BamHI/Kpnl fragments of Lp4CL1 (1771 bp), Lp4CL2 (2034 bp) or Lp4CL3 
(2080 bp) labelled using the Megaprime labelling kit (Amersham) and 
[ 32 P]dCTP. Hybridization was performed at 65 °C in 5 X SSPE, 5 X Denhardt's 
solution, 0.5 % (w/v) SDS, and 200 fxg/mL denatured herring sperm DNA. 
Membranes were washed three times in 2 X SSPE, 0.1 % SDS for 10 min at 
20 25 °C and then twice in 0.1 X SSPE, 0.1 % SDS for 20 min at 65 °C. 

RNA blot analysis 

Total RNA (10 \ig) was separated on 1.2 % formaldehyde gels and 
transferred to Hybond N (Amersham) membranes according to the 
manufacturers instructions. Membranes were stained with 0.2 % methylene 

25 blue to confirm correct loading and transfer of RNA. Hybridisation was 
performed at 42 °C in 5 X SSPE, 5 X Denhart's solution, 0.5 % SDS, 50 % 
deionized formamide, 200 ptg/mL denatured herring sperm DNA. Preparation 
of probes and washing of membranes was as for DNA blot analysis except for 
the tall fescue Northern blot when the final two washes were performed with 

30 0.1 X SSPE, 0.1 % SDS for 10 min at 42°C. 



WO 01/95702 



PCT/AU01/00699 



-31 - 

Results 

Isolation and sequence analysis of perennial ryegrass 4CL cDNAs 

A cDNA library prepared from RNA extracted from perennial ryegrass 
seedlings was screened with a rice 4CL hybridization probe and ten cDNAs 
5 were isolated from 2 X 10 5 pfu. The cDNAs were characterised by restriction 
analysis with 8 restriction enzymes. All clones were full length (approximately 
2.0-2.2 kb) with poly(A) tails and could be separated into three groups: Lp4CL1 
(four clones) Lp4CL2 (five clones) and Lp4CL3 (one clone). Plasmid maps for 
Lp4CL1, Lp4CL2 and Lp4CL3 are shown (Figure 1). Lp4CL1, Lp4CL2 and 
10 Lp4CL3 were fully sequenced (Figures 2, 3 and 4, respectively). 

Lp4CL1 is 2284 bp long with an open reading frame (ORF) of 1710 bp, 
a 5' noncoding region of 322 bp and a 3' noncoding region of 252 bp including 
a poly(A) tail. Lp4CL2 is 1992 bp long with an ORF of 1668 bp, a 5' noncoding 
region of 61 bp and a 3' noncoding region of 263 bp including a poly(A) tail. 
15 Lp4CL3 is 2038 bp long with an ORF of 1671 bp, a 5* noncoding region of 1 12 
bp and a 3 1 noncoding region of 255 bp including a poly(A) tail. 

Within the coding region, Lp4CL1 has 70 % nucleic acid sequence 
identity with both Lp4CL2 and Lp4CL3, while Lp4CL2 has 79 % sequence 
identity with Lp4CL3. There is little sequence homology in the 3' noncoding 
20 regions between clones (52-55 %). 

Amino acid sequence comparisons 

The putative proteins encoded by the three cDNAs consist of 570 amino 
acids [60290 u (Da)] for Lp4CL1 , 556 amino acids (59238 u) for Lp4CL2 and 
557 amino acids (59735 u) for Lp4CL3. The deduced amino acid sequences of 
25 Lp4CL1, Lp4CL2 and Lp4CL3 are shown (Figure 5). Lp4CL2 and Lp4CL3 
share 79 % amino acid sequence identity, Lp4CL1 and Lp4CL2 have 61 % 
amino acid sequence identity, while Lp4CL1 and Lp4CL3 have only 58 % 
amino acid sequence identity. Regions of high sequence homology are more 
prevalent in the central and c-terminal regions of the enzyme. For example the 
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sequence identity between amino acids 208 to 568 of each enzyme is 85 % for 
Lp4CL2 and Lp4CL3, 72 % for Lp4CL1 and Lp4CL2 and 67 % for Lp4CL1 and 
Lp4CL3. 

Lp4CL1 , Lp4CL2 and Lp4CL3 share several common regions with other 
5 plant 4CLs. In particular, they contain the putative AMP-binding domain and 
the conserved GEICIRG motif, except for Lp4CL3 where the second isoleucine 
has been replaced with valine (Figure 5). It has been proposed that domain II 
is associated with the catalytic activity of 4CL. Also, four Cys residues 
conserved in plant 4CLs are conserved in Lp4CL1, Lp4CL2 and Lp4CL3 
10 (Figure 5). These results suggest that the L perenne cDNAs encode three 
divergent 4CL enzymes that are likely to have originated from three different 
4CL genes. 

Expression of perennial ryegrass 4CL genes 

Lp4CL1, Lp4CL2 and Lp4CL3 were used as hybridization probes in 
15 Northern blots with RNA prepared from different organs of perennial ryegrass 
at two developmental stages. All three probes hybridized to a single mRNA 
species of approximately 2.2 - 2.3 kb. Lp4CL1, Lp4CL2 and Lp4CL3 were 
expressed at both seedling and mature stages of development and in all 
organs tested. For Lp4CL2 and Lp4CL3 the strongest signal was found in RNA 
20 samples from seedling roots and mature stems (Figure 6). 

Lp4CL1 , Lp4CL2 and Lp4CL3 were also used as hybridization probes in 
Northern blots with RNA prepared from tall fescue. All three probes hybridized 
to a similar mRNA species (2.3 kb) as that in perennial ryegrass (Figure 6). 
The strongest signal was found in RNA samples from mature stems with 
25 weaker signals in RNA from roots and seedling shoots. No expression of 
Lp4CL1, Lp4CL2 or Lp4CL3 was observed in leaves. The three probes varied 
in their ability to hybridize to the corresponding homologues in tall fescue, with 
Lp4CL3 resulting in the highest signal and Lp4CL1 hybridizing only weakly. 



To determine whether 4CL could be induced under stress conditions, 
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leaves of perennial ryegrass seedlings were wounded. No increase in the 
transcript level upon wounding was observed with Lp4CL1 , Lp4CL2 or Lp4CL3 
(Figure 7). 

Genomic organization of perennial ryegrass 4CL genes 

5 Perennial ryegrass DNA was digested with two restriction enzymes, 

Hind\\\ or Xba\. Restriction sites for these enzymes are not present in the 
cDNA sequence of Lp4CL1, Lp4CL2 or Lp4CL3. When Lp4CL1, Lp4CL2 or 
Lp4CL3 was used as a probe, several DNA hybridizing fragments of varying 
intensity were revealed (Figure 8). Each probe hybridized to a unique set of 

10 fragments, suggesting that Lp4CL1, Lp4CL2 and Lp4CL3 represent three 
different genes. Furthermore, Lp4CL1 and Lp4CL2 hybridized to 2 to 3 major 
fragments per digest which may represent either alleles of the same gene or 
indicate the presence of more than one gene in each class. The Lp4CL1, 
Lp4CL2 and Lp4CL3 probes also revealed several different size hybridizing 

15 DNA fragments in genomic Southern blots from tall fescue under high 
stringency conditions (Figure 8), suggesting that three similar 4CL genes are 
present in F. arundinacea. 

EXAMPLE 2 

Isolation and characterisation of a Cinnamoyl CoA Reductase (CCR) 
20 cDNA from Lolium perenne 

A total of 500,000 phage were screened from a cDNA library 
constructed from ten-day-old etiolated L perenne seedlings using a maize 
CCR probe. Ninety-three positive plaques were observed in the primary 
screen and five were subsequently analysed by restriction enzyme digestion. 
25 Four out of the five were identical. One of the four identical cDNAs, LpCCRI , 
was selected for further analysis (Figure 9). 

Nucleic acid sequence analysis of perennial ryegrass CCR cDNA 

The full nucleotide sequence of LpCCRI was obtained and the amino 
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acid sequence predicted (Figure 10). LpCCRI is a 1395 bp cDNA with 149 bp 
of 5' non-coding region and 160 bp of 3' non-coding region. An open reading 
frame of 1086 bp encodes a protein of 362 amino acids. The composition of 
the coding region was found to be 68% G+C rich. Codon usage was also 
5 examined and found to be biased towards XXC/G codons (94%), with XCG 
and XUA codons accounting for only 9% and 0.55% respectively. G+C 
richness and bias towards G and C in the third position of a codon triplet are 
previously reported characteristics of monocot genes. 

i 

Genomic organization of perennial ryegrass CCR gene 

10 The number of CCR genes present in the ryegrass genome was 

determined by Southern blot analysis of genomic DNA from double haploid 
plants, using as probe a fragment of the LpCCRI cDNA (LpCCR531, Figure 
9). Double haploid DNA reduces the complexity associated with allelic 
variation. Genomic DNA was cut with enzymes that do not cut the cDNA 

15 internally; Dra\, BamH\, EcoRI, EcoRV, H/ndlll and Xba\, and the membrane 
was hybridised and washed under medium-stringency conditions. A single 
strongly hybridising band was evident in each lane (Figure 11) indicating that 
there is a single copy of the LpCCRI gene in the perennial ryegrass genome. 

Expression of perennial ryegrass CCR gene 

20 To investigate the expression profile of the CCR gene in ryegrass, 

northern hybridisation analysis was carried out with total RNA extracted from 
roots and shoots at seedling growth stages (0.5-1 cm and 4-6cm shoots) and 
roots, stem and leaves at mature growth stages (6 and 10 weeks). Seedlings 
were grown on filter paper in the dark at 25°C and then transferred to soil and 

25 glasshouse conditions (25°C) until the 6 and 10-week stages. Whole seedling 
total RNA from Festuca and Phalaris was included in the northern analysis. 
Hybridisation with LpCCR531 (Figure 9) was performed at medium-stringency 
and the membrane was then washed at high-stringency. A transcript of 
approximately 1.5 kb was detected in all tissues, the level of expression 

30 varying with maturity and from one tissue type to another (Figure 12). The 
LpCCRI transcript appears to be more abundant in roots and stem than 



WO 01/95702 



PCT/AU01/00699 



-35- 

shoots and leaves. In the stem, transcript abundance increases from 6-weeks 
to 10-weeks; indicating that transcription in stem tissue is up-regulated as the 
plant matures. Expression was found predominantly in tissues such as stems 
and roots that are forming secondary cell walls indicating that LpCCRI is 
5 constitutively involved in lignification. 

EXAMPLE 3 

Isolation and characterisation of Cinnamyl Alcohol Dehydrogenase 
(CAD) cDNAs from Lolium perenne 

A 558 bp cinnamyl alcohol dehydrogenase (CAD) fragment was 
10 amplified from cDNA synthesised from total RNA prepared from perennial 
ryegrass seedlings. The conserved amino acid domains between Pinus 
radiata, Medicago sativa, Aralia cordata, Eucalyptus botryoides and 
Arabidopsis thaliana CADs were used to design oligonucleotides for the 
amplification of the perennial ryegrass CAD. The forward oligonucleotide was 
15 designed to the conserved amino acid domain CAGVTVYS and the reverse 
oligonucleotide to the conserved domain DVRYRFV. The 551 bp PCR 
fragment was cloned and sequenced to confirm that it corresponded to a 
perennial ryegrass CAD PCR fragment. A cDNA library prepared from RNA 
extracted from perennial ryegrass seedlings was screened with the 551 bp 
20 PCR fragment specific for perennial ryegrass CAD. Eight cDNAs were isolated 
and separated into six groups by restriction digest analysis. One 
representative clone each from two groups (LpCADI , LpCAD2) were selected 
for further characterisation. 

Nucleic acid sequence analysis of perennial ryegrass CAD cDNAs 

25 The complete sequence of the perennial ryegrass CAD homologue 

LpCADI was determined (Figure 13). The 1325 bp clone had a poly (A) tail, 
typical start and stop codons and the open reading frame (ORF) of this clone 
coded for a putative protein of 408 amino acids. 
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The complete nucleotide sequence of the perennial ryegrass CAD 
homologue LpCAD2 was also determined (Figure 14). 

Expression of perennial ryegrass CAD genes 

A northern hybridisation analysis with RNA samples isolated from 
5 perennial ryegrass at different developmental stages hybridised with the full 
length LpCADI 1325 bp cDNA (Figure 15) was performed to determine 
patterns of organ and developmental expression. The probe hybridised to a 
single mRNA species of approximately 1.6 kb. The LpCADI transcript was 
expressed in all tissue tested: roots, shoots, stem and leaves (Figure 16A). 
10 The LpCADI transcript was most abundant in root tissue and the mature stem, 
this expression pattern is typical of a gene involved in the lignification of plant 
cell walls. Intergeneric homologies were revealed in Festuca and Phalaris. 

A similar northern hybridisation analysis was performed with LpCAD2 
(Figure 16B), however the transcript was found to be most abundant in mature 
15 stem tissue and the shoots. 

Genomic organization of perennial ryegrass CAD genes 

A Southern hybridisation analysis using DNA samples isolated from a 
perennial ryegrass double haploid plant digested with Dra\, BamHl, EcoRI, 
EcoRV, HindlH and Xba\ and hybridised with a 500 bp LpCADI probe was 
20 performed. The hybridisation pattern at high stringency revealed the presence 
of two prominent bands for most digests indicating that LpCADI belongs to a 
small gene family and exists a muliticopy gene in perennial ryegrass (Figure 
17A). 

A similar Southern hybridization analysis was performed with LpCAD2 
25 (Figure 17B) the hybridisation pattern at high stringency revealed the presence 
of one or two prominent bands for most digests indicating that LpCAD2 exists 
as a single copy gene or a member of a small gene family in perennial 
ryegrass (Figure 17B). 
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EXAMPLE 4 

Isolation and characterisation of genomic clones and promoters for O- 
methyltransferase (OMT), cinnamoyl-CoA reductase (CCR), 4 coumarate 
CoA-ligase (4CL) and cinnamyl alcohol dehydrogenase (CAD) from 
5 Lolium perenne 

Genomic clones and promoters of O-methyltransferase (OMT), 
cinnamoyl-CoA reductase (CCR), 4 coumarate CoA-ligase (4CL) and cinnamyl 
alcohol dehydrogenase (CAD) were isolated from a perennial ryegrass 
genomic library using the corresponding cDNAs as hybridisation probes. 

10 Isolation and characterisation of genomic clones and promoters for 
perennial ryegrass O-methyltransferase (OMT) 

A perennial ryegrass genomic library was screened with the cDNA 
clone, LpOmtl, (Heath et al. 1998) encoding O-methyltransferase (OMT). The 
sequence of the 5' untranslated region and the coding region was found to be 
15 identical to that of the LpOmtl cDNA previously isolated. The entire 4.8 kb 
genomic clone was fully sequenced (Figure 18). 

To further characterise the promoters, transcriptional fusions of the 
promoter sequence to the ^-glucuronidase (GUS) coding sequence (gusA) 
have been generated (Figure 19). Direct gene transfer experiments to tobacco 

20 protoplasts were performed with the corresponding chimeric genes to 
transgenically express them in a heterologous system for in planta expression 
pattern analysis by histochemical GUS assays. A set of transgenic tobacco 
plants carrying a chimeric gusA gene under the control of the 5' regulatory 
region of the LpOmtl promoter was generated to assess the potential use of 

25 the LpOmtl promoter for xylem-specificity and targeted downregulation of 
genes encoding key lignin biosynthetic enzymes. 

The transgenic tobacco plants generated using the LpOmtl promoter 
driven chimeric gusA transformation vector were screened by PCR and 
histochemical GUS assays. 
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A PCR screening was undertaken using gusA specific primers for the 
initial identification of transgenic tobacco plants (Figure 20). PCR positive . 
tobacco plants were screened by histochemical GUS assays for in planta 
expression pattern analysis (Figure 20). 

5 Isolation and characterisation of genomic clones and promoters for 
perennial ryegrass cinnamoyl-CoA reductase (CCR) 

A CCR genomic clone from perennial ryegrass was isolated containing 
6.5 kb of promoter and the entire gene organisation (intron/exon boundaries). 
The CCR promoter can be used for targeted expression of foreign genes in 
10 transgenic plants. 

A perennial ryegrass genomic library was screened with the cDNA clone 
LpCCRI which codes for the lignin biosynthetic enzyme, cinnamyl-CoA 
reductase (CCR). Four different genomic clones were identified based on 
restriction digest analysis. Clone 6.1.1a was selected for further analysis. A 

15 6.42 kb Xho\ fragment from clone 6.1.1a, which hybridized strongly to the 
LpCCRI cDNA probe, was subcloned into pBluescriptSK (Figure 21 A). 
Sequence analysis revealed that the 6.42 kb Xho\ fragment contained the 
entire LpCCRI gene and 200 bp of promoter region. The intron/exon 
boundaries are illustrated in figure 21 B, the location and the size of the exons 

20 appear to be conserved in other CCRs from different species (Figure 21 C). 

To isolate the promoter region of LpCCRI, the Southern blot containing 
digested phage genomic DNA isolated from clone XLp6.1.1a was reprobed 
with the 200bp promoter region. The probe hybridized strongly to a 6.5 kb Sa/I 
fragment. This genomic fragment LpCCRI clone 2, was subcloned into 
25 pBluescriptSK and sequenced (Figure 22A). Sequence results revealed that 
the 6.5 kb Sa/I fragment contained 6.5 kb of promoter (Figure 22B). The full 
sequence of LpCCRI genomic clone containing the promoter and entire gene 
sequence (exons and introns) was obtained and is shown on Figure 39. 
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Isolation and characterisation of genomic clones and promoters for 
perennial ryegrass 4 coumarate CoA-ligase (4CL) 

A 4CL2 genomic clone from perennial ryegrass was isolated containing 
2.5 kb of promoter and partial gene organisation (intron/exon boundaries). The 
5 4CL2 promoter can be used for targeted expression of foreign genes in 
transgenic plants. The 2.5 kb promoter has been fused to the reporter gene 
gusA for expression analysis. 

A perennial ryegrass genomic library was screened with an Lp4CL 
cDNA probe. After tertiary screening positive 4CL genomic clones were 
10 obtained and characterised by restriction digest and Southern hybridisation 
analysis (Figure 23A). 

Sequence analysis revealed that the isolated 4CL genomic clone 
(4CL2) from perennial ryegrass had 100% nucleotide identity to the Lp4CL2 
cDNA clone. To further characterise this 5 kb A,Lp4CL2 genomic clone and to 

15 confirm that it corresponds to the cDNA of Lp4CL2, a number of PCR 
reactions using primers designed to the cDNA were used. PCR results 
confirmed that the 5 kb genomic fragment was a partial genomic clone 
corresponding to the Lp4CL2 cDNA (Figure 23B). Using primer combinations 
F1 and R1 the entire 4.8kb genomic fragment was amplified. To determine the 

20 location of introns additional PCR reactions using the primer combinations F1 / 
R2 and F2 / R1 were performed, a 1 kb and 3.5 kb bands were amplified 
respectively. The location and size of the introns could be determined from 
these results, and further confirmed by sequence analysis. This large 5 kb 
genomic fragment contains 4 small exons representing the coding sequence of 

25 Lp4CL2 between 508 bp and 1490 bp (Figure 23C). 

The genomic clone 1 , Lp4CL2 contained no promoter region. To isolate 
the promoter region of Lp4CL2, the Southern blot containing digested phage 
genomic DNA isolated from clone XLp4CL2 was reprobed with a 300 bp 
EcoRI/Bgll isolated from the 5* end of the cDNA clone Lp4CL2. The 300 bp 
30 probe hybridised strongly to a 2.5 kb BamHI fragment. This genomic fragment 
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Lp4CL2 clone 2, was subcloned into pBluescriptSK and sequenced (Figure 
24A). Sequence results revealed that the 2.5 kb BamH\ fragment contained 
the 508 bp of the 5' ORF of Lp4CL2 missing from genomic clone 1 and 2.0 kb 
of promoter region (Figure 24B). The full sequence of the Lp4CL2 genomic 
5 clone containing the promoter and partial gene sequence (exons and introns) 
was obtained and is shown on Figure 39. 

The promoter from Lp4CL2 was thus isolated and used for the 
production of a chimeric gusA reporter gene (Figure 25). 

Isolation and characterisation of genomic clones and promoters for 
10 perennial ryegrass cinnamyl alcohol dehydrogenase (CAD) 

A CAD genomic clone from perennial ryegrass was isolated containing 
the gene organisation (intron/exon boundaries) minus intron 1 containing the 
first 111 bp of the CAD coding region. The genomic clone has allowed the 
identification of a G at position 851 bp in the coding region of the CAD2 

15 genomic clone isolated from perennial ryegrass cv. Barlano which is absent in 
the CAD2 cDNA clone isolated from perennial ryegrass cv. Ellett. The SNP 
(single nucleotide polymorphism) found to exist between the 2 cultivars has the 
potential utility as a molecular marker for herbage quality, dry matter 
digestibility, mechanical stress tolerance, disease resistance, insect pest 

20 resistance, plant stature and leaf and stem colour. 

Results below show the isolation of the genomic clone and sequence 
analysis of deduced coding sequence from the genomic clone CAD2 from 
perennial ryegrass cv. Barlano compared to the truncated cDNA CAD2 from 
the cv Ellett. The missing G in the perennial ryegrass cv. Ellett has been 
25 highlighted (Figures 26 and 27). 

A perennial ryegrass genomic library was screened with a probe 
corresponding to the 5' end of the LpCAD2 cDNA clone, which codes for the 
lignin biosynthetic enzyme cinnamyl alcohol dehydrogenase. Ten positive 
plaques were identified and isolated in the primary library screening. After a 
30 secondary and tertiary screening, two positive plaques were obtained and 
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corresponding positive genomic clones were further characterised by 
restriction digest and Southern hybridization analyses. Both genomic clones 
were found to be identical based on restriction digest analyses. One clone, 
named A,Z_pCAD2 was chosen for further Southern hybridization analyses. A 
5 4.5 kb BamH\ fragment which hybridized strongly to the LpCAD2 cDNA probe 
was subcloned into pBluescriptSK and sequenced (Figure 28A). Sequence 
analysis revealed that the 4.5 kb BamH\ fragment was a partial genomic clone 
of LpCAD2. This large 4.5 kb genomic fragment contains 4 small exons 
representing the coding sequence of Z_pCAD2 between 213 bp and the stop 
10 codon at 1213 bp, and the location of the intron/exon boundaries are illustrated 
in Figure 28B. 

EXAMPLE 5 

Development of transformation vectors containing chimeric genes with 
4CL, CCR and CAD cDNA sequences from perennial ryegrass 

15 To alter the expression of the key enzymes involved in lignin 

biosynthesis 4CL, CCR and CAD, through antisense and/or sense suppression 
technology and for over-expression of these key enzymes in transgenic plants, 
a set of sense and antisense transformation vectors was produced. 
Transformation vectors containing chimeric genes using perennial ryegrass 

20 4CL, CCR and CAD cDNAs in sense and antisense orientations under the 
control of either the CaMV 35S or the maize ubiquitin promoter were 
generated (Figures 29, 30 and 31). 

EXAMPLE 6 

Production and characterisation of transgenic tobacco plants expressing 
25 chimeric 4CL, CCR and CAD genes from perennial ryegrass 

A set of transgenic tobacco plants carrying chimeric 4CL, CCR and 
CAD genes from perennial ryegrass were produced and analysed. 
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Transformation vectors with Lp4CL1, Lp4CL2 and Lp4CL3 full length 
cDNA sequences in sense and antisense orientations under the control of 
either the CaMV 35S or the maize ubiquitin promoters were generated. 
Transformation vectors with LpCCRI cDNA in both sense and antisense 
5 orientation under the control of either the CaMV 35S and maize ubiquitin 
promoters were generated. Transformation vectors with 1325 bp full length 
LpCADI cDNA in sense and 1051 bp partial LpCADI cDNA in antisense 
orientation under the control of either the CaMV 35S and maize ubiquitin 
promoters were generated. 

10 Direct gene transfer experiments to tobacco protoplasts were performed 

using these transformation vectors. 

The production and molecular analysis of transgenic tobacco plants 
carrying the perennial ryegrass Lp4CL1 and LpCCRI cDNAs under the control 
of the constitutive CaMV 35S promoter is described here in detail. 

15 A set of transgenic tobacco plants generated using the Z_p4CL1 sense 

transformation vector was screened by PCR and subjected to Southern and 
northern hybridization analyses. 



A PCR screening was undertaken using npt2 and Lp4CL1 specific 
20 primers for the initial identification of transgenic tobacco plants. Independent 
transgenic tobacco plants were identified to be co-transformed with both the 
selectable marker npt2 and the Lp4CL1 chimeric genes (Figure 32). 

Southern hybridisation analysis was performed with DNA samples from 
PCR positive transgenic tobacco plants to demonstrate the integration of the 
25 chimeric Lp4CL1 transgene in the tobacco plant genome. Independent 
transgenic tobacco plants carried between 1 and 5 copies of the Z_p4CL1 
transgene. No cross-hybridization was observed between the endogenous 
tobacco 4CL gene and the perennial ryegrass hybridization probe used (Figure 
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32). 



Northern hybridization analysis using total RNA samples prepared from 
the transgenic tobacco plants carrying the chimeric sense Lp4CL1 transgene 
and probed with the Lp4CL1 -specific hybridization probe revealed the 
5 presence of a 1.2 kb Z_p4CL1 transcript strongly expressed in one Lp4CL1- 
transgenic tobacco plant analysed (Figure 32). 

The sense and antisense transformation vectors of LpCCFM under the 
control of the CaMV 35S promoter were introduced into tobacco protoplasts 
via direct gene transfer. A set of transgenic tobacco plants was generated and 
10 screened by PCR with specific primers to identify transgenic tobacco plants 
carrying chimeric LpCCRI transgene. The molecular analysis of LpCCRI- 
transgenic tobacco plants is shown (Figure 33). 



EXAMPLE 7 



Production and characterisation of transgenic perennial ryegrass 
15 plants expressing chimeric OMT, 4CL, CCR and CAD genes from 

perennial ryegrass 



An improved transformation method was developed for the production 
of transgenic perennial ryegrass plants by biolistic transformation of 
embryogenic cells. Transgenic perennial ryegrass plants were generated using 
20 chimeric OMT, 4CL, CCR and CAD genes from perennial ryegrass and the 
improved transformation method. 

Improved method for the production of transgenic perennial ryegrass 
plants 

This improved procedure utilises embryogenic calli produced from 
25 mature seed-derived embryos as direct targets for biolistic transformation 
without requiring the establishment of embryogenic cell suspensions. The 
protocol relies on a continuous supply of isolated zygotic embryos for callus 
induction. Transgenic ryegrass plants can be regenerated 24 - 28 weeks after 
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embryo isolation (Fig. 34). Isolated embryos are plated onto MSM5 medium to 
produce embryogenic calli suitable as targets for biolistic transformation within 
8 weeks. The embryogenic calli, treated on high-osmoticum medium MSM3 
Plus prior to microprojectile bombardment, are selected on MSM3 medium 
5 containing 100 mg/l paromomycin (Pm) for 2 weeks before being transferred 
onto MSK with 100 mg/l Pm for further 4 weeks until differentiation of Pm 
resistant shoot appear. Regenerated shoots are transferred on to fresh 
selective media MSK with 100 mg/l Pm for a further 4 weeks (Figure 34). 

Production of transgenic perennial ryegrass plants expressing chimeric 
10 OMT, 4CL, CCR and CAD genes from perennial ryegrass 

Transgenic perennial ryegrass (Lolium perenne) plants were generated 
using chimeric ryegrass OMT, 4CL, CCR and CAD genes by biolistic 
transformation of embryogenic calli. Examples of the production and detailed 
molecular analysis of these transgenic ryegrass plants are described. 

15 Transgenic perennial ryegrass plants for OMT down-regulation were 

produced using biolistic transformation of embryogenic calli and plant 
transformation vectors pUbiomtl and pUbitmol carrying LpOmtl cDNA 
sequence in sense and antisense orientation under control of the constitutive 
maize ubiquitin promoter. These transgenic perennial ryegrass plants for 

20 down-regulated OMT activity were regenerated from paromomycin resistant 
calli obtained from biolistic transformation using microprojectilies coated with 
two plasmids; pHP23 (carrying the chimeric npt2 gene as the selectable 
marker) and either the sense or antisense LpOmtl transformation vector 
driven by the maize Ubi promoter. 

25 Transgenic perennial ryegrass plants were subjected to a polymerase 

chain reaction (PCR) screening using npf2-specific primers. Independent npt2 
PCR-positive transgenic perennial ryegrass plants obtained from biolistic 
transformation of embryogenic calli - generated from approximately 60,000 
isolated mature seed-derived embryos - using LpOmtl sense (pUbiomtl) and 

30 LpOmtl antisense (pUbitmol) transformation vectors were identified [16 
pUbiomtl transgenic plants and 27 pUbitmol transgenic plants] (Figure 35). 
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Southern hybridization analysis was performed with undigested and 
H/ndlll-digested DNA samples prepared from the PCR positive transgenic 
perennial ryegrass plants, to demonstrate their transgenic nature and the 
integration of the chimeric npt2 and LpOmtl transgenes. Independent 
5 transgenic perennial ryegrass plants co-transformed with both, the selectable 
marker npt2 gene and LpOmtl chimeric genes, were identified (Figure 35). In 
most instances, the transgenic perennial ryegrass plants recovered contained 
multiple copies of the selectable marker gene including rearranged transgene 
copies. No npf2-hybridizing bands were detected in the untransformed 
10 negative control. 

Samples of H/rtdlll-digested genomic DNA were included in the analysis 
when the LpOmtl gene-specific hybridization probe (omtl) was used. The 
omt 1 probe hybridized to a number of bands in DNA samples corresponding to 
both, the transgenic plants and the untransformed negative control. The omtl- 

15 hybridizing bands shared in all samples correspond to endogenous LpOmtl 
gene sequences represented as a small multigene family in the perennial 
ryegrass genome (Heath et al. 1998). The different omt -/-hybridizing bands 
evident in the samples from the transgenic plants and absent in the 
untransformed negative control sample correspond to antisense (tmol) and 

20 sense (omtl) LpOmtl transgene integration events (Figure 35). 

Northern hybridization analysis using strand-specific LpOmtl probes 
allowed the identification of transgenic perennial ryegrass plants expressing 
the antisense LpOmtl transgene (Figure 35). 

The OMT activity of selected antisense and sense LpOmtl transgenic 
25 perennial ryegrass plants was determined. Biochemical assays for OMT 
activity were initially established in untransformed plants (such as tobacco and 
perennial ryegrass). The assays utilise radiolabeled S-adenosylmethionine as 
the methyl donor for the OMT-catalysed conversion of caffeic acid into ferulic 
acid. The production of radioactive ferulic acid is measured and allows the 
30 OMT activity to be determined. 
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The OMT activity of selected LpOmtl -transgenic perennial ryegrass 
plants (L. perenne cv. Ellett) was determined. Significantly altered OMT activity 
in individual transformation events was observed (Figure 36). The 
manipulation of OMT activity in transgenic perennial ryegrass plants due to the 
5 expression of the chimeric ryegrass LpOmtl gene was thus demonstrated. 

Transgenic perennial ryegrass plants were recovered, using biolistic 
transformation of embryogenic calli, for the manipulation of the expression of 
genes encoding the key lignin biosynthetic enzyme, 4CL The plant 
transformation vectors pUbi4CL2 and pUbi2LC4 carrying chimeric Lp4CL2 

10 cDNA sequences in sense and antisense orientation, respectively, driven by 
the constitutive maize ubiquitin (Ubi) promoter were used. Perennial ryegrass 
plants for 4CL manipulation were regenerated from Pm-resistant calli obtained 
from biolistic transformation of embryogenic calli using microprojectiles coated 
with the plasmids pHP23, carrying a chimeric npt2 gene as selectable marker 

15 gene and the antisense pUbi2LC4. 

Transgenic perennial ryegrass plants were subjected to a polymerase 
chain reaction (PCR) screening using npt2-specific primers. Independent npt2 
PCR-positive transgenic perennial ryegrass plants were obtained from biolistic 
transformation of embryogenic calli (Figure 37). 

20 Transgenic perennial ryegrass plants were also recovered, using 

biolistic transformation of embryogenic calli, for the manipulation of the 
expression of genes encoding the key lignin biosynthetic enzymes, CCR and 
CAD. 

EXAMPLE 8 

25 Genetic mapping of perennial ryegrass OMT, 4CL, CCR and CAD 

genes 

Lp4CL1, Lp4CL3, LpCADI, LpCAD2, LpCCFM, LpOMTl and LpOMT2 
clones were PCR amplified and radio-labelled for use as probes to detect 
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restriction fragment length polymorphisms (RFLPs). RFLPs were mapped 
using 110 progeny individuals of the p1 50/1 12 perennial ryegrass reference 
population restricted with the enzymes described in the table below. 



Clones 


Polymorphic 


Enzyme 


Locus 


Linkage 




in p1 50/1 12 


mapped 




group 






with 






Lp4CL1 


Y 


Dra\ 


Lp4CL1 


2 


Lp4CL3 


Y 


EcoRV 


Lp4CL3 


6 


LpCADI 


Y 


EcoRV 


LpCADI 


2 


LpCAD 1.2.1 


Y 


EcoRI 


LpCAD2a 


7 








LpCAD2b 










LpCAD2c 


2 


LpCCRI 


Y 


EcoRI 


LpCCRI 


7 


LpOMTI 


Y 


Dra\ 


LpOMTI 


7 


LpOMT2 


Y 


EcoRV 


LpOMT2 


6 



5 



Lp4CL1, Lp4CL3, LpCADI , LpCAD2, LpCCRI, LpOMTI and LpOMT2 
loci mapped to the linkage groups as indicated in the table and in Figure 40. 
These gene locations can now be used as candidate genes for quantitative 
trait loci for lignin biosynthesis associated traits such as herbage quality, dry 
10 matter digestibility, mechanical stress tolerance, disease resistance, insect 
pest resistance, plant stature and leaf and stem colour. 

REFERENCES 

Heath et al (1988) cDNA cloning and differential expression of three 
caffeic acid O-methyltransferase homologues from perennial ryegrass (Lolium 
15 perenne). Journal of Plant Physiology 153:649-657 

Lichtenstein, C, And J. Draper (1985) Genetic engineering of plants. In: 
D.M. Glover (ed.), DNA Cloning, Vol. 2, pp. 67-119, IRL Press, Washington. 

Finally, it is to be understood that various alterations, modifications 
and/or additions may be made without departing from the spirit of the present 
20 invention as outlined herein. 
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It will also be understood that the term "comprises" (or its grammatical 
variants) as used in this specification is equivalent to the term "includes" and 
should not be taken as excluding the presence of other elements or features. 

Documents cited in this specification are for reference purposes only 
5 and their inclusion is not an acknowledgement that they form part of the 
common general knowledge in the relevant art. 



WO 01/95702 



PCT/AU01/00699 



-49- 

CLAIMS 

1. A substantially purified or isolated nucleic acid or nucleic acid 
fragment encoding an enzyme selected from the group consisting of 4 
coumarate CoA-ligase (4CL), cinnamoyl-CoA reductase (CCR) and cinnamyl 

5 alcohol dehydrogenase (CAD), from a ryegrass (Lolium) or fescue (Festuca) 
species. 

2. A nucleic acid or nucleic acid fragment according to claim 1 
wherein said ryegrass or fescue species is perennial ryegrass (Lolium 
perenne). 

10 3. A nucleic acid or nucleic acid fragment according to claim 1 

encoding 4CL and including a nucleotide sequence selected from the group 
consisting of (a) sequences shown in Figures 2, 3 and 4 hereto (Sequence ID 
Nos: 1 , 3 and 5; respectively) (b) complements of the sequences shown in 
Figures 2, 3 and 4 hereto (Sequence ID Nos: 1, 3 and 5, respectively); (c) 

15 sequences antisense to the sequences recited in (a) and (b); and (d) 
functionally active fragments and variants of the sequences recited in (a), (b) 
and (c). 

4. A nucleic acid or nucleic acid fragment according to claim 1 
encoding CCR and including a nucleotide sequence selected from the group 
20 consisting of (a) the sequence shown in Figure 10 hereto (Sequence ID No: 7); 
(b) the complement of the sequence shown in Figure 10 hereto (Sequence ID 
No: 7); (c) sequences antisense to the sequences recited in (a) and (b); and 
(d) functionally active fragments and variants of the sequences recited in (a), 
(b) and (c). 

25 5. A nucleic acid or nucleic acid fragment according to claim 1 

encoding CAD and including a nucleotide sequence selected from the group 
consisting of (a) the sequences shown in Figures 13, 14, 26 and 27 hereto 
(Sequence ID Nos: 9, 11, 14 and 16, respectively); (b) complements of the 
sequences shown in Figures 13, 14, 26 and 27 hereto (Sequence ID Nos: 9, 
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1 1 , 14 and 16, respectively); (c) sequences antisense to the sequences recited 
in (a) and (b); and (d) functionally active fragments and variants of the 
sequences recited in (a), (b) and (c). 

6. A vector including a nucleic acid or nucleic acid fragment 
5 according to claim 1 . 

7. A vector according to claim 6 further including a promoter and a 
terminator, said promoter, nucleic acid or nucleic acid fragment and terminator 
being operatively linked. 

8. A plant cell, plant, plant seed or other plant part, including a 
10 vector according to claim 6. 

9. A method of modifying lignin biosynthesis in a plant, said method 
including introducing into said plant an effective amount of a nucleic acid or 
nucleic acid fragment according to claim 1 and/or a vector according to claim 
6. 

15 10. Use of a nucleic acid or nucleic acid fragment according to claim 

1, and/or nucleotide sequence information thereof, and/or single nucleotide 
polymorphisms thereof as a molecular genetic marker. 

11. A substantially purified or isolated polypeptide from a ryegrass 
(Lolium) or fescue (Festuca) species, selected from the group consisting of the 

20 enzymes 4CL, CCR and CAD. 

12. A polypeptide according to claim 11 wherein said ryegrass or 
fescue species is perennial ryegrass (Lolium perenne). 

13. A polypeptide according to claim 11 wherein said polypeptide is 
4CL and includes an amino acid sequence selected from the group consisting 

25 of sequences shown in Figures 2, 3 and 4 hereto (Sequence ID Nos: 2, 4 and 
6, respectively); and functionally active fragments and variants thereof. 
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14. A polypeptide according to claim 11 wherein said polypeptide is 
CCR and includes an amino acid sequence selected from the group consisting 
of the sequence shown in Figure 10 hereto (Sequence ID No: 8); and 
functionally active fragments and variants thereof. 

5 15. A polypeptide according to claim 11 wherein said polypeptide is 

CAD and includes an amino acid sequence selected from the group consisting 
of the sequence shown in Figures 13, 14, 26 and 27 hereto (Sequence ID 
Nos: 10, 12, 15 and 17, respectively); and functionally active fragments and 
variants thereof. 

10 16. A lignin or modified lignin substantially or partially purified or 

isolated from a plant, plant seed or other plant part according to claim 8. 

1 7. An isolated regulatory element capable of causing expression of 
an exogenous gene in plant cells, wherein said regulatory element is isolated 
from a nucleic acid or nucleic acid fragment encoding a protein selected from 

15 the group consisting of: O-methyl transferase (OMT), 4 coumarate CoA-ligase 
(4CL), cinnamoyl-CoA reductase (CCR) and cinnamyl alcohol dehydrogenase 
(CAD). 

18. A regulatory element according to claim 17 wherein said 
regulatory element includes an O-methyltransferase promoter. 

20 19. A regulatory element according to claim 17 wherein said 

regulatory element includes a 4 coumarate CoA-ligase promoter. 

20. A ' regulatory element according to claim 17 wherein said 
regulatory element includes a cinnamoyl CoA-reductase promoter. 

19. A regulatory element according to claim 17 from a ryegrass 
25 (Lolium) or Fescue (Festuca) species. 

20. A regulatory element according to claim 17 including the first 
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approximately 4630 nucleotides of the sequence shown in Figure 18 hereto 
(Sequence ID No: 13); or a functionally active fragment or variant thereof. 

21. A regulatory element according to claim 17 including the first 
approximately 2206 nucleotides of the sequence shown in Figure 38 hereto 

5 (Sequence ID No: 1 7); or a functionally active fragment or variant thereof. 

22. A regulatory element according to claim 17 including the first 
approximately 6735 nucleotides of the sequence shown in Figure 39 hereto 
(Sequence ID No: 18); or a functionally active fragment or variant thereof. 

23. A vector including a regulatory element according to claim 1 7. 

10 24. A vector according to claim 23 further including an exogenous 

gene and a terminator, said regulatory element, exogenous gene and 
terminator being operatively linked, such that said regulatory element is 
capable of causing expression of said exogenous gene in plant cells. 

25. A plant cell, plant, plant seed or other plant part, including a 
15 vector according to claim 23. 

26. A method for expressing an exogenous gene in plant cells, said 
method including introducing into said plant cells an effective amount of a 
regulatory element according to claim 17 and/or a vector according to claim 
23. 



20 
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CGG C AC GAGTGGACTTTC CG AC GC CG GAG TCG C C GAT GATGAC C G CCTTGAGG AGG TAGT 
1 + + + + + + 60 



CGTAGTCGTCCTCCGCCCTGTACGCGCCGCTGCCCGCCATTTCCTTCCTCGCCTCGCGGG 
61 + + + + + + 120 



TCCTCCTCCCCGACCTGCGCTAGGCTCTGGATCTCGCGGGGTTTGGGCGCGGCGTCCTCG 
121 + + + + + + 180 



CTGTGAGCTCGTGCCGAATTCGGCACGAGCCACCTTCGAGGCGTGCACTGGTACGAGCTC 
181 + + + + + + 240 



GCGAGCCATTGTCAGTGCAGTGTAGGCTCTGCTACTCGTTGGCCATTCCAAGAAGCTCTC 
241 + + + + + + 300 



TG CTC CCTG AAAC CAGAG GATCATGATCACG GTGGCGGCGCCC GAG GTGCAGCAG CCGCA 

301 + + + + + + 360 

MITVAAPEVQQPQ 



GATCGCGGCGGCTGCTGCGGCCGTGGAGGCGGCGGCACCGGAGGCGACGACGATCTTCCG 

3 61 + + + + + + 420 

IAAAAAAVEAAAPEATTIFR 



GTCCAGGCTCCCGGACATCGACATCCCGACCCACATGCCCCTGCACGACTATTGCTTCGC 

421 + + + + + + 480 

SRLPDIDIPTHMPLHDYCFA 



GACGGCAGCCTCGGCCCCGGACGCGCCGTGCCTCATCACCGCGGCCACGGGGAAGACCTA 

481 + + + + + + 540 

TAASAPDAPCL I TAATGKTY 



CACGTTCGCCGAGACGCACCTGCTGTGCCGCAAGGCCGCGGCGGCGCTGCACGGGCTCGG 

541 + + + + + + 600 

TFAETHLLCRKAAAALHGLG 



CGTGCGCCACGGGGACCGGATCATGCTGCTGCTCCAGAACTCCGTGGAGTTCGCGCTCGC 

601 + + + + + + 660 

VRHGDRIMLLLQNSVEFALA 

CTTCTTCGGCGCGTCCATGCTCGGCGCCGTCAGCACGGCGGCGAACCCGTTCTGCACGCC 

661 + + + + + + 720 

F FGASMLGAVS TAANPFCTP 



GCAGGAGATCCACAAGCAGCTCGTGGCCTCCGGCGCGAAGCTGGTCGTCACGCAGTCCGC 

721 + + + + + + 780 

QEIHKQLVASGAKLVVTQSA 
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CTACGTCGACAAGCTCCGGCACGAGGCCTTCCCCCGAATCGGCGAGGCCCTCACCGTGAT 

7 81 + + + + + + 840 

YVDKLRHEAF PR IGEALTVI 

CACCATCGACGAGGACGACGGCACCCCGGACGGCTGCCAGCCGTTCTGGGCCCTCGTGTC 

841 + + + + + + 900 

TIDEDDGTPDGCQPFWALVS 

AGCCGCCGACGAGAACAGCGTCCCGGAGTCTCCCATCTCGCCGGACGACGCGGTGGCGCT 

901 + + + + + + 960 

AADENSVPESPI SPDDAVAL 

GCCCTACTCGTCGGGCACGACGGGGCTGCCCAAGGGCGTGGTGCTGACGCACGGGGGGCT 

961 + + + + + + 1020 

PYSSGTTGLPKGVVLTHGGL 

GGTGTCGAGCGTGGCGCAGCAGGTGGACGGCGAGAACCCGAACCTGCACATGCGGGCGGG 

1021 + + + + + + 1080 

VSSVAQQVDGENPNLHMRAG 

GGAGGACGTGGTGCTCTGCGTGCTGCCGCTCTTCCACATCTTCTCGCTCAACTCGGTGCT 

1081 + + + + + + 1140 

EDVVLCVLPLFHIF SLNSVL 

GCTGTGCGCGCTGCGGGCGGGCGCCGCCGTGATGCTGATGCCTAGGTTCGAGATGGGGGC 

1141 + + + -+ r— + + 1200 

LCALRAGAAVMLMPRFEMGA 

CATGCTGGAGGGCATCGAGCGGTGGCGCGTCACGGTGGCGGCCGTGGTGCCGCCGCTGGT 

1201 + + + + + + 1260 

MLEGIERWRVTVAAVVPPLV 

GCTCGCGCTCGCCAAGAACCCCGGGGTGGAGAAGCACGACCTCAGCTCCATTCGGATCGT 

1261 + + + + + + 1320 

LALAKNPGVEKHDLSSIRIV 

GCTCTCCGGCGCCGCGCCGCTCGGCAAGGAGCTCGAGGACGCGCTACGTGGCCGCCTGCC 

1321 + + + + + + 1380 

LSGAAPLGKELEDALRGRLP 

GCAGGCCATCTTCGGACAGGGCTACGGGATGACGGAGGCCGGGCCGGTGCTGTCCATGTG 

1381 + + + + + + 1440 

QAIFGQGYGMTEAGPVLSMC 

CCCGGCGTTCGCGCGGGAGCCGACGCCGGCCAAGTCCGGCTCGTGCGGCACCGTGGTGCG 

1441 + + + + + + 1500 

PAFAREPTPAKS GS CGTVVR 
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CAACGCCCAGCTCAAGGTGGTCGACCCCGACACCGGCGTCTCCCTCGGCCGCAACCTCCC 

1501 + + + + + + 1560 

NAQLKVVDPDTGVSLGRNLP 

CGGCGAGATCTGCATCCGCGGCCCGCAGATCATGAAAGGATACTTGAATGATCCCGTGGC 

1561 + + + + + + 1620 

GEICIRGPQIMKGYLNDPVA 

CACGGCCGCGACCATCGACGTCGAGGGGTGGCTCCACACCGGCGACATCGGCTACGTCGA 

1621 + + + -+ + + 1680 

TAATIDVEGWLHTGDIGYVD 

CGACGACGACGAGGTCTTCATCGTCGACCGCGTCAAGGAGCTCATCAAGTTCAAGGGCTT 

1681 + + + + + + 1740 

DDDEVF IVDRVKELIKFKGF 

CCAGGTACCGCCGGCCGAGCTCGAGGCTCTGCT CATC GCG CATC CGTCCATCGCCGACGC 

1741 + + + + + + 1800 

QVPPAELEALL IAHPSIADA 

GGCCGTCGTCCCGCAAAAGGATGATGCCGCCGGCGAGGTCCCGGTTGCCTTCGTGGTCCG 

1801 + + + + + + 1860 

AVVPQ KDDAAGEVPV AFVVR 

CGCCGCCGACTCCGACATCGCCGAGGAGGCCATCAAGGAGTTCGTATCCAAGCAGGTGGT 

1861 + + + + + + 1920 

AADSDIAEEAIKEFVSKQVV 

GTTCTACAAGAGGCTGCACAAGGTCTACTTCACCCACGCGATACCCAAGTCGGCGTCGGG 

1921 + + + + + + 1980 

FYKRLHKVYFTHAIPKSASG 

GAAGATACTCAGGAAAGAACTCAGAGCTAAACTCGCCGCCCCGGCCACTGCCTGAAGAGT 

1981 + + + + + + 2040 

KILRKELRAKLAAPATA*RV 

GGTTCATG GCTTCATGCTAATCATTTCGATCAGAAAG GCACTTCTAGCATATATGTTCCA 

2041 + + + + + + 2100 

VHGFML I IS IRKALLAYMFH 

CCTTTTGTTTCATTTGGAAGATTGTATTCCAGCTAGTGGCCAGTGACTGAGTAAGGGATG 

2101 + + + + + + 2160 

LLFHLEDCIPASGQ* 

GGGATAAAAGTTTTGTCTACGTTTTCTTTTACGCTACTCTCTCCATTGGGGAGTACAATG 
2161 + + + : — + + + 2220 

TATCAGGGGATTCGTGATTGAAGTTAATCAAGATTGGTTCAATTATAAAAAAAAAAAAAA 
2221 + + + + + + 2280 

AAAA 
2281 2284 
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CGGCACGAGCGCCATTCCTCCACCTTCAGCTCCGGCCAAAGATTTCCATCCGGCGAGATC 
1 + + + + + + 60 

CATGG G CTC CATC GCGGCGGACGCGCCTCCCGCG GAG CTGGTGTTCCGGTCCAAGCTCCC 

61 + + + + + + 120 

MGS IAADAPPAELVFRSKLP 

GGACATCGAGATCCCGACCCACCTGACGCTGCAGGACTACTGCTTCCAGCGCCTGCCGGA 

121 + + + + + + 180 

DIE I PTHLTLQDYCFQR LPE 

GCTCTCCGCGCGCGCCTGCCTCATCGACGGCGCCACGGGCGCCGCGCTCACCTACGGCGA 

181 + + + + + + 240 

LSARACL IDGATGAALTYGE 

GGTGGACGCCCTGTCCCGCCGCTGCGCCGCGGGGCTGCGCCGCCTCGGCGTCGGCAAGGG 

241 + + + + + + 300 

VDALSRRCAAGLRRLGVGKG 

CGACGTCGTCATGGCGCTCCTCCGCAACTGCCCCGAGTTCGCCTTCGTGTTCCTCGGCGC 

301 + + + + + + 360 

DVVMALLRNCPEFAFVFLGA 

GGCCCGGCTCGGCGCCGCCACCACCACCGCCAACCCGTTCTACACGCCCCACGAGATCCA 

361 + + + + + + 420 

ARLGAATTTANPFYTPHEIH 

CCGCCAGGCCACCGCCGCCGGGGCCAGGGTCATCGTCACCGAGGCCTGCGCCGTCGAGAA 

421 + + + + + + 480 

RQATAAGARVIVTEACAVEK 

GGTGCGCGCCTTCGCCGCCGAGAGAGGGATTCCCGTCGTCTCCGTCGACGAGGGCGTCGA 

481 + + + + + + 540 

VRAFAAERG I PVVSVDEGVD 

CGGCGGCTGCCTCCCGTTCGCCGAGACTCTGCTCGGGGAAGAAAGCGGGGAGCGGTTCGT 

541 + + + + + + 600 

GGCLPFAETLLGEESGERFV 

CGACGAGGCGGTCGACCCCGACGACGTGGTGGCGCTGCCGTACTCGTCCGGCACCACCGG 

601 + + + + + + 660 

DEAVD PDDVVALPYS SGTTG 

CCTGCCCAAGGGCGTCATGCTCACCCACCGCAGCCTCGTCACCAGCGTCGCCCAGCAGGT 

661 + + + + + + 720 

LPKGVMLTHRSLVTSVAQQV 

GGACGGTGAGAACCCGAACCTGCACTTCAGCTCGTCGGACGTGCTGCTGTGCGTGCTGCC 

721 + + + + + + 780 

DGENPNLHFSSSDVLLCVLP 
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GCTGTTCCACATCTACTCGCTCAACTCGGTGCTGCTCGCCGGTCTCCGCGCCGGGTGCGC 

781 -+ + + + + + 840 

LFHIYSLNSVLLAGLRAGCA 

GATCGTGATCATGCGCAAGTTCGACCACGGCGCGCTGGTGGACCTGGTGCGCACGCACGG 

841 + + + + + + 900 

IVIMRKFDHGALVDLVRTHG 

CGTCACCGTGGCGCCATTCGTGCCGCCCATCGTGGTGGAGATCGCCAAGAGCGCGCGGGT 

901 + + + + + + 960 

VTVAPFVPPIVVEIAKSARV 

GACCGCCGCGGACCTGGCGTCCATCCGGCTGGTCATGTCGGGGGCGGCGCCCATGGGCAA 
961 + + + + + + 1020 

TAADLAS IRLVMS GAAPMGK 

GGAGCTGCAGGACGCGTTCATGGCCAAGATCCCCAACGCCGTGCTCGGCCAGGGATATGG 

1021 + + + + + + 1080 

ELQDAFMAK IPNAVLGQGYG 

GATGACCGAGGCCGGCCCTGTGCTGGCGATGTGCCTGGCCTTCGCCAAGGAGCCGTTCGC 

1081 + + + + + + 1140 

MTEAGPVLAMCLAFAKEPFA 

GGTCAAGTCCGGTTCCTGCGGCACCGTCGTCAGGAACGCCGAGCTCAAGATCGTCGACCC 

1141 + + + + + + 1200 

VKSGSCGTVVRNAELKIVDP 

CGACACCGGCGCCTCCCTCGGCCGCAACCTGCCGGGGGAGATCTGCATCCGCGGCAAGCA 

DTGASLGRNLPGEICIRGKQ 

GATCATGAAAGGTTACCTAAATGATCCGGTGGCCACAAAGAACACCATTGACAAGGACGG 

1261 + + + + + + 1320 

IMKGYLNDPVATKNTIDKDG 

TTGGCTGCATACTGGTGACATTGGTTATGTCGATGATGACGACGAGATCTTTATTGTCGA 

1321 + + + + + + 1380 

WLHTGDIGYVDDDDEIFIVD 

CAGACTGAAGGAGATAATTAAATATAAGGGATTCCAAGTACCTCCGGCGGAACTTGAAGC 

1381 + + + + +- + 1440 

RLKEI IKYKGFQVPPAELEA 

CCTTCTCATTACACACCCTGAAATCAAGGATGCTGCTGTCGTATCGATGCAAGACGAACT 

1441 + + + + + + 1500 

LLITHPEIKDAA VVSMQDEL 

TGCTGGTGAAGTTCCGGTTGCGTTTGTTGTGCGGACTGAGGGTTCAGAGATCAGCGAAAA 

1501 + + + + + + 1560 

AGEVPVAFVVRTEGSEISEN 
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CGAGATCAAGCAGTTCGTTGCAAAAGAGGTTGTTTTCTACAAGAGGATCTGCAAAGTGTT 

1561 — + + + + + + 1620 

EIKQFVAKEVVFYKRICKVF 

CTTCGCGGATTCCATTCCAAAGAGTCCATCTGGCAAGATCCTCAGGAAGGACCTGAGAGC 

1621 + + + + + + 1680 

FADS IPKSPSGKILiRKDLRA 



AAAG C TCG C CG C AG G CATTC C CAG CAGTAATAC CACACAGTCC AAAAG CTAAGTC AG ATA 

1681 + + + + + + 1740 

KLAAGIPSSNTTQSKS* 



TATTGTTTCCCAACCTTACACACCTCTGTCCAACACCATGTAATGTTCTTAATATAAACG 
1741 + + + + + + 1800 



GAAATTAT TAC ATATAGAAG GG CTGATTCTTTTTACTAG ATGTGT C CAAC ATATG ATATG 
1801 + + + + + + 1860 

CTTGTTAGGCCGATGATGTGTAACCTGTCATGTATAGATACCGCCTTTTTTTGACAAGAA 
1861 + + + + + + 1920 



AG GCT G ATT AT AATG TATAC CGTG AACTGAATATTTGTTCAGGG AGATCAAAAAAAAAAA 
1921 + + + + + + 1980 



AAAAAAAAAAAA 
1981 + — 1992 
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CGGC ACGAGATCTC C C ACGACTAATTTAGAAGAAGATTTAC TTAGTCTCTGCTTCTCGCT 
1 + + + + + + 60 

CGAT CGC CGGCCGGTGAGGTAG CTAGCTAGC TACTCGTAC TAGAC CATTACC ATGGGTTC 

61 + + + + + + 120 

M G S 

CGTGCCGGAGGAGTCAGTGGTGGCGGTGGCACCGGCGGAGACGGTGTTCCGGTCGAAGCT 

121 + + + + + + 180 

VPEESVVAVAPAETVFRSKL 

CC CCGACATCGAG ATC AACAACGAGCAGACGC TGCAGAGC TACTGCTTCGAG AAGATGGC 

181 ; + + + + + + 240 

PDIEINNEQTLQSYCFEKMA 

CGAGGTCGCGTCCCGCCCCTGCATCATCGACGGCCAGACGGGCGCCTCCTACACCTACAC 

241 + + + + + + 300 

EVASRPCI IDGQTGASYTYT 

GGAGGTCGACTCCCTGACCCGTCGCGCCGCGGCGGGGCTGCGCCGCATGGGCGTGGGGAA 

301 + + + + + + 360 

EVDSLTRRAAAGLRRMGVGK 

GGGCGACGTGGTGATGAACCTGCTGCGCAACTGCCCGGAGTTCGCCTTCTCCTTCCTGGG 

3 61 + + + + + + 420 

GDVVMNLLRNCPEFAFSFLG 

CGCGGCGCGGCTGGGCGCCGCCACCACCACCGCCAACCCGTTCTACACCCCGCACGAGAT 

421 " + + + + + + 480 

AARLGAATTTANPFYTPHEI 

CCACCGCCAGGCGGAGGCGGCGGGCGCCAAGCTGATCGTGACCGAGGCCTGCGCCGTGGA 

481 + + + + + + 540 

HRQAEAA.GAKL IVTEACAVE 

GAAGGTGCTGGAGTTCGCGGCGGGGCGGGGCGTGCCCGTGGTCACCGTCGACGGGAGGCG 

541 + + + + + + 600 

KVLEFAAGRGVPVVTVDGRR 

CGACGGGTGCGTGGACTTCGCGGAGCTGATCGCCGGCGAGGAGCTGCCCGAGGCGGACGA 

601 + + + + + + 660 

DGCVDFAELIAGEELPEADE 

GGCCGGGGTCCTCCCCGACGACGTCGTCGCCCTGCCCTACTCCTCCGGCACCACCGGGCT 

661 + + + — + + + 720 

AGVLPDDVVALPYSSGTTGL 

CCCCAAGGGCGTCATGCTCACCCACCGCAGCCTCGTCACCAGCGTCGCCCAGCTGGTCGA 

721 + + + + + + 780 

PKGVMLTHRSLVTSVAQLVD 
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CGGGTCGAACCCTAACGTGTGCTTCAACAAGGACGACGCGCTGCTGTGCCTGCTGCCGCT 

781 + + + + + + 840 

GSNPNVCFNKDDALLCLLPL 

CTTCCACATCTACTCGCTGCACACGGTGCTGCTGGCGGGGCTCCGCGTCGGCGCCGCCAT 

841 + + + + + + 900 

FHIYSLHTVLLAGLRVGAAI 

CGTCATCATGCGCAAGTTCGACGTCGGCGCGCTGGTGGACCTCGTCCGCGCGCACCGCAT 

901 + + +- + +- + 960 

VIMRKFDVGALVDLVRAHRI 

CACCATCGCGCCATTCGTGCCGCCCATCGTCGTGGAGATCGCCAAGAGCGACCGCGTCGG 

961 + + + + + + 1020 

TIAPFVPPIVVEIAKSDRVG 

CGCCGACGACCTCGCATCCATCCGCATGGTGCTCTCCGGCGCCGCGCCCATGGGCAAGGA 

1021 + + + + + + 1080 

ADDLAS IRMVL SGAAPMGKD 

CCTCCAGGACGCCTTCATGGCCAAGATCCCCAACGCCGTGCTCGGACAGGGGTACGGGAT 

1081 + + + + + + 1140 

LQDAFMAKIPNAVLGQGYGM 

GACCGAGGCTGGGCCGGTGCTGGCCATGTGCCTGGCGTTCGCCAAGGAGCCGTTCAAGGT 

1141 + + + + + + 1200 

TEAGPVLAMCLAFAKEPFKV 

CAAGTCCGGGTCGTGCGGAACCGTGGTGCGCAACGCCGAGCTCAAGGTCGTCGACCCCGA 

1201 + + + + + + 1260 

KSGSCGTVVRNAELKVVDPD 

CACCGGCGCATCCCTCGGCCGGAACCAGCCTGGCGAGATTTGCGTCCGGGGGAAGCAGAT 

1261 + + + + + + 1320 

TGASLGRNQPGEICVRGKQI' 

C ATG AT AGG TTAC CTG AAC G AC C C AG AGTC G AC C AAG AAC AC CAT C G AC AAGG AC GG C TG 

1321 + + + + + + 1380 

MIGYLNDPESTKNTIDKDGW 

GCTGCACACCGGAGACATCGGCTTGGTGGATGACGACGACGAGATCTTCATCGTCGACAG 

1381 + + + + + + 1440 

LHTGDIGLVDDDDE IFIVDR 

GCTCAAGGAGATCATCAAGTACAAGGGCTTCCAAGTGGCGCCGGCGGAGCTCGAGGCCCT 

1441 + + + + + + 1500 

LKEIIKYKGFQV-APAE LEAL 

CCTCCTCACGAACCCGGAGGTCAAGGACGCCGCCGTCGTAGGGGTGAAGGATGATCTCTG 

1501 + + + + + + 1560 

LLTNPEVKDAAVVGVKDDLC 
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CGGCGAAGTCCCGGTCGCCTTCATTAAGAGGATCGAAGGATCTGAGATCAACGAGAACGA 

1561 + + + + + + 1620 
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CACCGACTCCATTCCCAAGAACCCTTCCGGCAAGATCCTAAGGAAGGACTTGAGAGCCAG 
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GGCACGAGGAATCCTACCAAACCGAGCTACCAGATCCTTCTCTACTAATCGAGCTCCCTA 
1 + + + + + + 60 

CGCTGCTCCGCCTGTCTTCGTTTCCGCCTCACCGCCGGCCGGTTCTCCGCTCCAAGCTAC 
61 + + + + + + 120 

GTCCGTCCGTCCACATATATAGCATCGACATGACCATCGCCGAGGTCGTGGCTGCCGGAG 

121 + + + + + + 180 

MTIAEVVAAGD 

ACACCGCCGCCGCGGTGGTGCAGCCCGCCGGGAACGGGCAGACCGTGTGCGTGACCGGCG 

181 + + + + + + 240 

TAAAVVQ PAGNGQTVCVTGA 

CCGCCGGGTACATCGCGTCGTGGCTCGTCAAGCTGCTGCTGGAGAAGGGGTACACCGTCA 

241 + + + + + + 300 

AGYIASWLVKLLLEKGYTVK 

AGGGCACCGTCAGGAACCCAGACGACCCGAAGAACGCGCACCTGAGGGCGCTCGACGGCG 

301 + + + + + + 360 

GTVRNPDDPKNAHLRALDGA 

CCGCCGACCGGCTGGTCCTCTGCAAGGCCGACCTCCTCGACTACGACGCCATCCGCCGCG 

361 + + + + + + 420 

ADRLVLCKADLLDYDAIRRA 

CCATCGACGGCTGCCACGGCGTCTTCCACACCGCGTCCCCCGTCACCGACGACCCCGAGC 

421 + + + + + + ,480 

IDGCHGVFHTASPVTDDPEQ 

AAATGGTGGAGCCGGCGGTGAGGGGCACGCAGTACGTCATAGACGCGGCGGCGGAGGCCG 

481 + + + + + + 540 

MVE PAVRGTQYVI DAAAEAG 

GCACGGTGCGGCGGATGGTGCTCACCTCCTCCATCGGCGCCGTCACCATGGACCCCAACC 

541 + + + + + + 600 

TVRRMVLTSSIGAVTMDPNR 

GCGGGCCGGACGTGGTCGTCGACGAGTCGTGCTGGAGCGACCTCGACTTCTGCAAGAAAA 

601 + + + + + + 660 

GPDVVVDESCWSDLDFCKKT 

CCAGGAACTGGTACTGCTACGGGAAGGCGGTTGCGGAGCAGGCGGCATCGGAGTTGGCGC 

661 + + + + + + 720 

RNWYCYGKAVAEQAASELAR 

GGCAGCGCGGCGTGGACCTTGTGGTGGTGAACCCGGTGCTGGTGATCGGCCCCCTGCTGC 

721 + + + + + + 780 

QRGVDLVVVNPVLVIGPLLQ 
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AGCCGACGGTGAACGCCAGCATCGGCCACATCCTCAAGTACCTGGACGGGTCGGCCAGCA 

781 + + + + + + 840 

PTVNASIGHILKYLDGSASK 



AGTTCGCCAACGCCGTGCAGGCGTACGTGGACGTCCGCGACGTGGCCGACGCCCACCTCC 

841 + + + + + + 900 

FANAVQAYVDVRDVADAHLR 



GCGTCTTCGAGTGCGCCGCCGCGTCCGGCCGCCACCTCTGCGCCGAGCGCGTCCTCCACC 

901 + + + + + + 960 

VFECAAASGRHLCAERVLHR 



GCGAGGACGTCGTGCGCATCCTCGCCAAGCTCTTCCCCGAGTACCCCGTCCCCACCAGGT 

961 + + + + + + 1020 

EDVVRILAKLFPEYPVPTRC 



GCT CTG ATG AG AC GAACC CGAG G AAGCAG CCATACAAGATGTCGAACC AG AAGC TC CAGG 

1021 + + + + - + + 1080 

SDETNPRKQPYKMSNQKLQD 

ACCTCGGACTCGAGTTCAGGCCGGTGAGCCAGTCCCTGTACGAGACGGTGAAGAGCCTCC 
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EKGHLPVLS EQAEADKETLA 



CTG C C GAGC TG C AG G CAGGG GTTAC CATC CGAG CATGAGG AAC AAGAAATCAAC C ATGTC 

1201 + + + + + + 1260 

AELQAGVTIRA* 

CATACTGCTACTGTCATGTAAACCAGCTGTTGAATGCCTAAAATCTAAGTTCTTGTAATA 
1261 + + + + + + 1320 



CTGT GTTGTTTC ATGTG G ACT AG ATTGATC GAATAAAC ATCTC TACACAAG GTT G CTAAA 
1321 + + + + + + 1380 



AAAAAAAAAAAAAAA 
1381 + 1395 
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GGCACGAGCAACAAGTCATCAATGGCGGAAGGCTTGCCGGCGCTCGGTTGGGCTGCGAGG 

1 + + + + + + 60 

MAEGLPALGWA.AR 



GACGCCTCCGGTCACCTCTCCCCTTACAGCTTCTCGAGAAGCGTTCCGAAGGACGACgAT 

61 + + + + + + 120 

DASGHLSPYSFSRSVPKDDD 



GTGACGATCAAGGTGCTCTTCTGCGGGATCTGCCACACTGACCTCCACATCATCAAGAAC 

121 + + — + + + + 180 

VTIKVLFCG ICHTDLHIIKN 



GACTGGGGCAACGCCCTCTACCCCATCGTCCCAGGGCATGAGATCGTGGGCGTCGTCGCC 

181 + + + + + + 240 

DWGNALYP IVPGHEIVGVVA 



AGCGTCGGCAGCGGCGTCAGCAGCTTCAAGGCCGGCgACACGGTGGGCGTGGGCTACTTC 

241 + + + + + — + 300 

SVGSGVS SFKAGDTVGVGYF 



CTCGACTCCTGCCGCACCTGCTACAGCTGCAGCAAGGGGTACGAGAACTTCTGCCCCACC 
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LDSCRTCYSCSKGYENFCPT 



CTGACGCTCACCTCCAACGGCGTCGACGGCGGCGGCGCCACCACCCAGGGCGGCTTCTCC 
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LTLTSNGVDGGGATTQGGFS 



GACGTCCTCGTCGTCAACAAGGACTACGTCATCCGCGTCCCGGACAACCTGCCCCTGGCC 
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DVLVVNKDYVIRVPDNLPLA 



GGCGCGGCACCTCTCCTCTGCGCCGGCGTCACAGTCTACAGCCCTATGGTGGAGTACGGC 
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GAAPLLCAGVTVY S PMVEYG 



CTCAACGCCCCcgGGAAGCACyTCGGcGTCGTCGGCCTGGGCGGGCTCGGCCACGTCGcC 

541 4- + + + + + 600 

LNAPGKHXGVVGLGGLGHVA 



GTCAAGTTCGGCAAGGCCTTCGGGATGACCGTCACCGTCATCAGCTCCTCGGACAGGAAG 

601 + + + + + ' + 660 

VKFGKAFGMTVTVISSSDRK 



CGCGACGAGGCGCTCGGCCGCCTCGGCGCCGACGCcTTCCTCGTCAGCAGCGACCCCGAG 

661 + + + + + + 720 

RDEALGRLGADAFLVSSDPE 
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CAGATGAAGGCGGCGGCGGGCACCATGGACGGCATCATCGACACGGTGTCCGCGGGCCAC 

721 + + +- + + + 780 

QMKAAAG TMDG I I DTVSAGH 

CCGATCGTGCCGCTGCTCGACCTGCTCAAGCCCATGGGGCAGATGGTCGTGGTGGGCGCG 
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PIVPLLDLLKPMGQMVVVGA 

CCCAGCAAGCCGCTCGAGCTCCCGGCCTTCGCCATCATCGGCGGCGGCAAGCGCCTCGCC 
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GGGAGCGGCACCGGCAGCGTCGCACACTGCCagGCCATGCTCGACTTCGCGGGCAAGCAC 
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GGCATCACCGCCGACGTCGAGGTCGTCAAGATGGACTACgGTCAACACCGCCATCGAGCG 
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G ITADVEVVKMDYGQHRHRA 
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AREERRQVPLRHRRRRQPPA 
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GHRRLTCATQCGRALVWSRK 

AGGTTCGCCGGCTCACAGCCACATGAACAAGTCAATGAGTCGTTGGTGTGTTGTTTATCT 

1141 + + + + + + 1200 

RFAGSQPHEQVNESLVCCLS 

TCATTCCACATATGGGACGCAGTTCCAGATTTTCATGTCAAATAATTGCGTCGTGTGCGG 

1201 + + + + + + 1260 

SFHIWDAVPDFHVK 

TTGTCAAGACTCAAATAGGAGAAAAAAAGACTCGTGATTTCGTTTTGCAAAAAAAAAAAA 
1261 + + + + + + 1320 

AAAAA 
1321 1325 
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QTEHHQHTRKAVGLAARDDA 
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GHLSPLAITRRSTGDDDVVI 

TAAAGATTTTGTACTGCGGAATCTGCCACTCTGACCTGCACGCCCTGAAGAACGACTGGA 
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KILYCGICHSDLHALKNDWK 

AGAACTCAAGGTACCCGATGATCCCCGGGCACGAGATCGCCGGCGAGGTCACGGAGGTGG 
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NSRYPMIPGHE IAGEVTEVG 

GCAAGAACGTGAGCAAGTTCAAGGCCGGCGACCGCGTGGGCGTCGGGTGCATGGTGAACT 
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KNVSKFKAGDRVGVGCMVNS 

CGTGCCGGTCGTGCGAGAGCTGCGACAAGGGCTTCGAGAACCACTGCCCGGGCATGATCC 
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CRSCESCDKGFENHCPGMIL 

TCACCTACAACTCGGTCGACGTCGACGGCACCGTCACCTACGGCGGCTACTCCAGCATGG 
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TYNSVDVDGTVTYGGYSSMV 

TGGTGGTGCACGAGCGGTTCGTGGTCCGGTTCCCCGACGCCATGCCGCTGGACAAGGGCG 
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VVHERFVVRFP DAMPLDKGA 

CGCCGCTGCTGTGCGCCGGCATCACCGTGTACA.GCCCCATGAAGTACCACGGGCTCAACG 
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PLLCAG I TVYS PMKYHGLNV 

TTCCCGGGCTGCACCTCGGCGTGCTGGGGCTGGGCGGGCTGGGCCACGTTGCGGTCAAGT 
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PGLHLGVLGLGGLGHVAVKF 

TCGGCAAGGCCTTCGGAATGAAAGTGACGGTGATCAGCTCGTCGCCGGGGAAGAAGGAGG 
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GKAFGMKVTVI SSSPGKKEE 
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AGGCCCTGGGGCGGCTGGGCGCCGACGCGTTCATCGTCAGCAAGGACGCCGACGAGATGA 
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ALGRLGADAFIVSKDADEMK 

AGGCTGTGATAGCACCATGGATGGCATCANTAAACACGGTATCTGCAAACATCCCCCTGA 

841 + + + + + + 900 
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pBluescript 

GCGGC C GCTC T AAAACT AGTGG ATC C CCCGGGCTGC AGGAATTCGATATCAAGCTTATnG 
Sail 

ATACCGTCGACAGCGGTTnCAAATCGCCGGTCCTGGGGTGGAAGTGnAGCAGTGGGAAGA 

+ + + -+ + ---- +- -4581 

TGTGTGCGAGGGGTTGTGTTTTGGATGnAAGACAGGCGGGCCAGTGGAGAACAAGAGAGA 
-4580 + + _ — + — + + +- -4521 

ACGCGAGAGGCCAAAGTATCCGCAGCCCCGCAAACAAGGCCTAGATTTGGGTTAAGTTTG 
-4520 — + — — + - — + — - + + — -4461 

GGTCGTCTCAGACACCGCGGCCATCCTTTTAGGTGGTCCGCGCGCTGGACCGTATTTTTA 
-4460 +- + — — +- ____+ — + -4401 

TCTGAGTTGACCCATTCAGACGCGCAGACACGAGATGGATGGTGCAGTwAgAGATGACCT 
-4400 + +— +- + + + -4341 

Hindlll 

AAGTACAArAACCTCTCCCCGA . GCTGCCGCCATCcGTCACTTACCGAGCGAcAAAGcTT 
-4340 + - +- + — +- + -4281 

CCCACTTCATCACACTCAGCCCAGCAAGCATACTGATGGTGAGCGCACTCGCGGCTGTGC 
-4280 +-- +- +- + + + -4221 

CCACCGACCCCACGCCATCCAAAACCAACTCTACTTTTCACCmCACCAACAAAAGACAAA 
-4220 + — — + — + — + + — — -+ — — -4161 

AT ATGGTGGATTTTGTG AT GAG ATGGAAGCGG AGCTTGT C AGAATGGG AAACGC AT AAAT 
-4160 + -+ + j. + + -4101 

CGAGAACACGTATACAGTGCTGGAAATTGGATGACTAAGCCCCAAGGGTTAGAAAAAAAA 
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TnAGACCATGTCTAGATGGAATTAGACATTTTTTGATATAATAGAAGCGGGACTTGGCGC 
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GACAATTTCAAACTTCGTCCCTAACAGGTATCGAACtTTCGAtAGTTAGCGTGTGCTACT 
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GCggAcCCCCAACCACtTGTGTTAAGCCCACATCgGTTAAGGCCCAAGGGTTAGATGAAA 
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-3 860 +— + + — — + — + +- -3801 

CTACAATTTTGACCTTGGTAACGTAAGTATGGACTGTATGGTGTGCTAAGGTGTGTTGGC 
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GTGTG AGTGTGC AAGC AAAAAAAAATGGAG AAGC C AAAAC C C GTTG AG AG AGTGCGAGAG 
-3620 +— -+ — + -- + -+- + -3561 

CATACAAGAACACCACAACAAAGTGTGAAGGAGAAAAAGAATATGAGATAAGATTTCGGA 
-3560 +— — + + + + — + -3501 
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AATACTTTTGCACACCCATGCATGGGTGTGGGTGTTTCCGTCACCGTCTATGTATTTCTC 
-3500 + + + +-- + — + -3441 

GAAATTCATGCCCACCATGGTAGATAAAAATATTTTTTTCTCTCTCCTCTTTTTATTCAA 
-3440 + — + + — + + — r — -3381 

ATCTCAAAGCAtAAkrArTGGTGACAGAACGATAAGATTCCTACCTAGCTTTCTGAGATC 
-3380 + -+----- +— + __+_ — + -3321 

CCACTAGTTTATCTTCAAGCTGGTGATTGAAGGATTAACCATGCTTGAATTAGATTGGCT 
-3320 + + — +- — __ + „_„ _+ -3261 

TCAAACTTGGTAGTAGCTTGTTTCATACTTTGATTACTTTGGTATGGTTAGTTGGTTTGA 
-3260 + — - + - + — + + -~ + -3201 

GATTTTGGTCAATGTAGAATCAGATTTGAGAGCGATTGTCAGCTTGAATTGCCGCAGTTT 
-3200 + + +— + -+ + -3141 

TAGCACATACTAGTTTGGATAGATGAACAGTTTGGAGAGACAAATAATGTCTATACGAGC 
-3140 +— -- + — + +----- +- + — -3081 

TCATCGGATAATATTAGTCTATGGCTTTTGCTTCGGTGTCCCCTCTGCAAACTTTACCCC 
-3080 +— + + -+ -3 021 

TCTGTAG ATGGT AGG ATTTT CTGAT AT C C TTT C ATGGTTT AAGGGTGTGCGTGT AAGG AA 
-3020 + -+ -+ + + +— -2961 

CGGGAGATACCGGATCACACCTTTTCGTCTACACTTTACAAGCATGTAACACCTAAGATT 
-2960 +- + — + — --+ +- + -2901 

GATTGATATCTAGGCTTACACCCCAATGGAGGTAAACTAATATTATTGAAATGCGACTTT 
-2900 +- +— + + + — — + — - -2841 

TCAAAAGTCCCAATATAACCTTGACGATGATCTTACAACTACTCGCGCCAGTCTTGTATG 
-2840 + + - + + - + + — -2781 

Kpnl 

AT AT C AG ATTGG CC G AGG AT CGTGGGTACCTTGT AGTGGACT ATG ATGCT C ATGGAGGTT 
-2780 + — + --+ -+- — + -2721 

GTATGGACATGTTGTAATGCTGGTTTTCTCTAGGTTTTTTCTAATCAACTTGGCATTCTT 
-2720 + — -+ - + +- + — — -2661 

CTCCTTAACACATAATAAGAGGGAATACCTCCATACATTATTCTGAAAAAAGCATGGCCA 
-2660 + -+ + + + -2601 

ACAATGAAACAGAAACAAGTACGACAGTCTATACCCGACCCAAACAATGGCTCAGGTCTT 
-2600 + + +- + — + -2541 

TC AC GATGC AT AGTTTGTT AGC ATGT ATTTT AT AGT AGGAACT AAAATTTAAAGAC AACT 
-2540 + + + + -- + -2481 

TGCnAAAACAATTTTGTCTCTTGAGTGTTTTTTAAGGATGCGGCATTTATCGATTATACA 
-2480 + + -+-- + - + -2421 

TT AC AT ATGTG ATTGG AT t AGCC AAC TTTTTGTCTTC C gATG ATC AT ATGAAAGGGTTGT 
-2420 + + + +- -2361 

ATCTTAGGGCATCTCCAATGGGnAGACTCAAATGCAAAAAAATnGTCCGTTTGGGTCTTC 
-2360 — + „ + + _ + __^ -2301 

CnGG AC AAAAC CTGC TC C C AACGGGG C AAC C C AACTT AAAAACGG AC AGGTGC AGCGTC C 
-2300 + + + - +- + -- + -2241 
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GGCnTGACCCAAAACTGACGCAAATTTGGnAnATTTTTGGGGCnAGCCAGACGAACGCGG 
-2240 + + +— — + + -2181 

GCGTCCACTGTATCCGACTATGTCCGCATCCTGGCCCATCTGACAGTGACACAAAATACA 
-2180 +- — + + + + -2121 

ACCACATGCGCCCCCCACCCTTCTCTCTCCTCCGTTCGCCTTTTCCCATGGAAnCnGTCC 
-2120 +- + + — -+ — + — - + -2061 

TCGCTCCTCGCCGGAATTGATCTCGCCTAACCATGCTCCGCCGCCACCcTCGcCTlcAAGG 
-2060 + + — +-- + + — — + — -2001 

CCCCAgCCGCCGCTACcTCCTTTTTGTCAGCCCTATTgGAAGTCGCCGgAGTTGAAACGA 
-2000 +- + + -+ + -1941 

GCGCCGCCAGCCTcGACACCGCCGAGCAAGACGAAGACTGGGCGGAGCTCGCCGAGACGG 
-1940 + + — + — + — - + +- -1881 

GACGGGGACGGAGCTCGCCATGCGTGCCTCGCAGGGGCGCGATGGGGGCGGAGCTCGCCG 
-1880 +- — - + -- + + + — + -1821 

PstX 

TGGCTGGCTGCAGCACCTCGGGCCGCTGCTAGCCGTGCCACGACGCGAGCATGCGCCTCG 
-1820 + — — — + + — + + — + — -1761 

ACGCCGCCCCGTGCTACCTCGTCGCGCGCCCAGGGCCGCCCCGCCCCTGCCGACCGgCGg 
-1760 +----- + + + — — +- -+ — — — -1701 

CGgAgACGCGAcCTTCGCGgACGTGCCCGGCGGCAGAGACGCGTCCTTCGCGACAGCGCC 
-1700 +-- + --^ — +- + — +- + -1641 

CTCCTCGATCTCCGTCGAGCCGCATACGCgGcTAgGAgGGACGCGGGCGTCCCCGGTGTC 
-1640 + — + — - — + — + — + + -1581 

GGCCTCCGTTGTGGCGCATCGCGGGCGCGGCCTCCGTCGAGGcGCATCGCGGGCGTGGCC 
-1580 + — -- + + — + + — + -1521 

TCGTGGCGCAGCCTGCCCTGATTCGGTCTGAGGCGCGGCGCGGAGCTTCCTCGCGGCGGC 
-1520 + — — + - -+ + + — - + — -1461 

GCGGGCGGAGCCTCCTCGCTGCGGCGCGACCTGCTCTGCCGCGGTCCGAGACGCGGCGCG 
-1460 +- +- + — + — + -1401 

GGCAGAGCTTCCTCGCGGCGGCTCGGGCGCGGCTTCCTCGCGGCGATGGCGCTTCCAGGC 
-1400 + - — -+ + — + — -+ — ■■ -1341 

TCGCACGCGGCCTCCGGCGTGGCGCAGCGAGAGCGCAGCCTCCGGTGAGTTAGGCACAGG 
-1340 + — + — -+ + + + -1281 

CGCGACACGACATCCCCGGCCTCGGCCTCCGGCGTGGCGCAGCGCGAGCGCGACGTAGCC 
-1280 + — + — — + — + + — — - + -- -1221 

T AGGTT GG C AACT AGT aC T ACG AGG AAG AAAG AGG AGAAAC AATTATTTGGGTC AC AGC G 
-1220 + — + — --+ — + + — _ + -1161 

TTGGGCGTACTGTGCGATCCAAACGGACACCCgGACGCGAaACGATGTCAGCGTGTCCGC 
-1160 +— + + + + ---+ -1101 

GTGGcGACCCAAACGACCCGAAACGGACGTCcGTTTGGGTCGGTGCGTTGGAGATGCCCT 
-1100 +- — — + + — + -- + — -+ — — -1041 

TACTCCCCATCCTCAAATGAGTCTAATTATATATCTTGTTGTAAGTTTTAAAAAAGTTAA 
-1040 + — + + + — + — + -981 



FIGURE 18 CONTINUED 



WO 01/95702 PCT/AU01/00699 



29/76 

ACTTTGATCAACATTAGTAATGATAGTAGCAACGAATACAAAATTAAATTGTAAAAATAT 
-980 + -+ — + + — + + -921 

ATTATGAAACTTTATTTTAAGATGGATCTAGTTATACTAATTTTCTGCGGATGGAGGAAG 
-920 + + — + + -+-- + -861 

TAGCTAAATATTGTTAATTTCTAAATAAAAAATTAAAACTTTAACTTAAAACAAAAGTTA 
-860 + - + -- +- + -+ +- -801 

Putative Myb Binding domains 
CAAGCATAATTATCTGtGGATGGAGGAAGtAGCTAAGATAC ACCAATCC TCTCTCTACAT 
-800 + + - + +— + +- -741 

TACCTAGC ATGC CAC ATC AGGAAACTATTT AGGATAAGCTC CAAGG AACCACCCA G AACA 
-740 +-- + + — — + + + -681 

ACAA TTTACATGGCCTGGC TAACCTAAT GACAATTTCCGAGCAACTGGTGGTGGTGGTAC 
-680 +— -+ + — — — +-— — + + — — -621 

GCGTTCCTTGTTCAATTGTCTCTATTACAAGAGTGGCCCTGTATAGGTAAAAAAAAATAA 
-620 +-- - + + + +- -561 

Hindlll PstI 
CAAGCTTCC AAGGACGGC C ATGTTC CTTGTTCCTGCAGGCTGC ACGTACTC ACGACGAAG 
' -560 + - + + — + + + -501 

TGTATCTCGTGTTCTGGACATTTGTCTCGCGCATTTTGTAACCATGAAATTAAAAATGTG 
-500 + — - + + + — — +- + -441 

GTGGCCTGCTATATCTGTATGGGGGTATCATGCACTCCTTCGCAGAGGAATCCAGACGAC 
-44 0 + + — — +- +- — + — -381 

GATTTACACGTGTTTCCACCTTAGCTTTTTTTAAGTGTGTGTGTAAGGAACGATCATATA 
-3 80 + + — -+— + + -321 

Xhol 

ACTGCCCCTGAATGCTGCATATATATAAACCGACTCCATCATGT ACTCGAG ACAAGGTCG 
-320 + + -+ -+ + — + -261 

TCAAGAAAAACAAACTATGCCTATCTCACTAGCAATGATTTGAGAGTACAGCTTTTCCGG 
-260 +- + + + +~- + -201 

TGCCATATTTTTTCCTATATATCTTTTTCTGAAGAACAAGAAAAAAAAAACAGTTGGTGT 
-200 + + — +-- — — -+ --+ -141 

GGTGGTTGGTGAAGCGAGAAAGCCCCATATAAGCCCTGCTCACCCTCCCCGCAAAGCACA 
-140 + - + +- + + — - + -81 

PvuJ 

ACTCATAGCTCGGGTCTCTCGCTCACACCAAAATCGCCCACCAGCACCAGCATCTCTCGA 
-80 + - + + — +-- + - - + -21 

TCGGCAGACGCATAGATCGATGGGCTCCACCGCCGCCGACATGGCCGCGTCCGCGGACGA 

-20 + + +— -+ + 39 

MGSTAADMAASADE 

GGACGCGTGCATGTTCGCCCTCCAGCTCGCTTCCTCGTCGGTCCTCCCGATGACGCTGAA 

40 + + + + +~ + — — 99 

DACMFAL QLASSSVLPMTLK 

GAACGCCATCGAGCTTGGCCTCCTGGAGATCCTGGTGGCCGCCGGCGGCAAGTCGCTGAC 

100 + — + + +— + + 159 

NAIELGLLEILVAAGGKSLT 

CCCGACCGAGGTGGCCGCCAAGCTCCCGTCCGCGGCGAACCCGGAAGCGCCGGACATGGT 

160 + — + -+ — -+- --+-- + 219 

PTEVAAKLPSAANPEAPDMV 
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GGACCGCATACTCCGGCTGCTCGCGTCGTACAACGTCGTGACGTGCCTGGTGGAGGAGGG 

220 + + + +— + - + 279 

DRI LRLLASYNVVTCLVEEG 

CAAGGACGGCCGCCTCTCCCGGAGCTACGGCGCCGCGCCCGTGTGCAAGTTCCTCACCCC 

280 + — + — + + + + 339 

KDGRLSRSYGAAPVCKFLTP 

CAACGAGGACGGCGTCTCCATGGCGGCGCTCGCGCTCATGAACCAGGACAAGGTCCTCAT 

340 + + + — - +- + + 399 

NEDGVSMAALALMNQDKVLM 

Intron/exon boundaary 
GGAGAG CTGiGTGAG TCTCTCAGTGGAGCTAGTTACTGTAGATCCGAATTCGTTCCCTTTA 

40 0 +- + -+ + + 459 

E S 

Sail pBluescript 
GTGAGGGTTAATTCCGCGGCCGCGTCGACCTCGAGGGGGGGCCCGGTACCCAATTCGCCC 
460 +- - + — — - + + 

TATAGTGAGTCGTATTACGCGCGCTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAA 
AACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGT 
AATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAA 
TGGG AC GC GC CCTGT AGCGGC GC ATT AAGCGC GGCGGGTGTTGTG 744 
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FIGURE 19 




Ryegrass OMT gusA CaMV35S 

Promoter polyA 
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TCCCGTATCTTCAACGTGACACCCCTACACTTCCTGCTTGTCTTGGAGATTTACACACAC 
1 + + + + + + 60 

ACGGCAATTACCAGGAGTATCTTCCTAGATTATTTTTTTCGATAAGGATCTTCCAGATAT 
61 + + + + + + 120 

AGCATGTGAATCTCTGTACTACTACTGTTTGTCAAGCAAAATTAACATTGACATCAGTGT 
121 + + + + + + 180 

TTTTGTTGGGGGCAGCGGAATCTTTGACGCCTCTTCTTGCCTCTCAAGACATGTCACCCT 
181 + + + + + + 240 

CACTAGTTAGTGTGCCAGCTGGTAGTACTACGTACGATGCTCCCTCCCTCCGTAATTATT 
241 + + + + + + 300 

CAACCTTTTTGCTCTCTCTTTTTATAAAGTCAAACCTTTTAAATCTGACCAGATATCTGC 
301 + + — + + + ~ + 3 60 

TAAAAAATTAGCAGACATGCATACATCAAAGCAGTAGTCCTCCCTCCGTTTAAAATTACC 
3 61 + + + + + + 420 

TGG G TTTATTC AAATAAAGTC AAAC TC TGTAAAATTC AATTAAATATTTAG AAA2VAT C TA 
421 + + + + + + 480 

ACAGCACCTGTAGTATAAAAGTATGCTCCCTCTGTTTGTAAAAAAGCTAAGCAACTTTTT 
481 + + + + + + 540 

TGAG ATAC G G AT AAAT C TTTAG C TAAAACATG TC TATATAC C TTTGTATC TAGATAAAGT 
541 + + + + + + 600 

TG G AAAG C TTTTT TAG AAAC AG AC AAAGTATGTGTTTG AC ATTATGAATG TTG AG TATTT 
601 + + + + + + 660 

TTC CTC TAATCTTGATCAAATTTTACAAATTTTGGCTTGAATAGAGGGAC C ATT AT TAGT 
661 + + + + + + 720 

ATGAAACTACATAAATTTGTAAAACACTCAACATAATTTACGATGGGTCAGTGATAGCAC 
721 + + + + + + 780 

TAACTTAGCTTTTCATAAATGCCACTGCTTTTCAATAGAGCATGAAGCAGGACAAATTTA 
781 + + + + + — + 840 

TTCGTGTGACTTGAATAGAGGGAGCCTGTTCTGGTTCAACTCACCCTGCATGTGTGTCTT 
841 + + + + + + 900 

CATCCCTTTTGCTCTTCCTATCTGTGGTGTCAATTGAGTGTCCCACGTGCATGTGGGCGA 
901 + + — + + + — + 960 
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AACTTGAACCTAGAAATTGACATGCTCCCACTGCCCGGAGCGGAGTATCTTTGTGCTTTG 



961 + + + + + + 1020 

TTAC C C TTATTG TTG C TAC G TAC TAC AGTG TTTAG ATTGG AAC TT C ATAATC AAAAG AAC 
1021 + + + + + + 1080 

TTAGTTTC C TAC AATTTT TTG C TAAG C AATATAATG AG C AATC AAAC TTC TATATC TGTG 
1081 + + + + + + 1140 

G C AAATAAC TAAT C CATT ATAGTTAC AG TTTAG ATG C AG AC G C C AGTGTTTC TTC C C CTT 
1141 + + + + + + 1200 

TTCGGAAAAAAGCTATTCCATAATAAGTGTTGGAAATTTAATAAATGGGTACTACGAATT 
1201 + + + + + + 1260 

TG AAAAAAAAAGTG T C AAAAATTC AC TAAGAAAGTAC GTAGTAC AAATTTAAAC TAAGAT 
1261 + + + + + + 1320 

TC CG AC AC TTATTAGG ATC G GAG AG AG TAAG TAG C AAAC TAC TAC TC CAT C C ACC TAAAA 
1321 + + + + + + 13 80 

CACGTGATTTAACTTTGTCTAGATAC GGATAGAAAGTTGGGATACATC CGTATCTTAAAA 
1381 + + + + + + 1440 

AAAAAC G C ACTTATTTTAGAC G AAG G AGG GAG TATTTC AAC CTTGATTTT AAAC GG AATC 
1441 + + + + + + 1500 

TAC AAAGGGAATACATGGATTGTAC AAGTGGGCTGAC CGTATC CATTATGTACTCGTACT 
1501 + + + + + + 1560 

TTGCAGTTTGAAAGCAAAGGCTAGTGTAATTTGTAGGTGGTTCTAGGCGTCTAGCTGTTT 
1561 + + + + + + 1620 

CATGGCGTTATCACAGCCGTGCCAGTGTGCTCAGGGCCGTACATAAGTTGCTTGGTGTAT 
1621 +- + +- + + + 1680 

GTGTCGATCTAGGATTTGCCGTCTTACAATTTTGCTTTCCAACTTATTTTCTGTAAAGAG 
1681 + + + + + + 1740 

ATC GATGTGAAC TTCT CTGTCG AG TAAAC TG AAATTGTC TG AATAAATATAACTCGG CAG 
1741 + + + + + + 1800 

ATT ATG TTTTATC GTTTG C ATGC G TAAC AGG C TACACAAATTG C TC GAG T CAG CAG C GAG 
1801 + + + + + + 1860 

TTG AG C TC AC AAC GAAT C CATC AG C AAAAATAC TATAC TATAGTAG C AC ATC GTTTC TTT 
1861 + + + + + 1920 
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TTTCATGAC GTTTCTGTTTCTTC CTAACTTTC CAGGAG CACC GGAGACG ACGATGTGGTG 

1921 + + + + + + 1980 

RSTGDDDVV 

ATAAAGATTTTGTACTGCGGAATCTGCCACTCTGACCTGCACGCCCTGAAGAACGACTGG 

1981 + + + + + + 2040 

IKILYCGICHSDLHALKNDW 

AAG AAC TC AAGG TAC C C G ATG ATC C C CG GG C AC GAG AT CG C C G G C GAG GT C AC G G AGGTG 

2041 + +— + + + + 2100 

KNSRYPMIPGHEIAGEVTEV 

GGCAAGAACGTGAGCAAGTTCAAGGCCGGCGACCGCGTGGGCGTCGGGTGCATGGTGAAC 

2101 + + + + -+ + 2160 

G KNVSKFKAGDRVGVG CMVN 

TCGTGCCGGTCGTGCGAGAGCTGCGACAAGGGCTTCGAGAACCACTGCCCGGGCATGATC 

2161 + + — + -+ + + 2220 

SCRSCESCDKGFENHCPGMI 

CTCACCTACAACTCGGTCGACGTCGACGGCACCGTCACCTACGGCGGCTACTCCAGCATG 

2221 + + + + + + 2280 

LTYNSVDVDGTVTYGGYSSM 

GTGGTGGTGCACGAGCGGTTCGTGGTCCGGTTCCCCGACGCCATGCCGCTGGACAAGGGC 

2281 + + + + + + 2340 

VVVHERFVVRF PDAMPLDKG 



GCGCCGCTGCTGTGCGCCGGCATCACCGTGTACAGCCCCATGAAGTACCACGGGCTCAAC 

2341 — + + — + + + + 2400 

APLLCAG ITVYS PMKYHGLN 



GTTCCCGGGCTGCACCTCGGCGTGCTGGGGCTGGGCGGGCTGGGCCACGTTGCGGTCAAG 

2401 + + + + + + 2460 

VPGLHLGVLGLGGLGHVAVK 



TTCGGCAAGGCCTTCGGAATGAAAGTGACGGTGATCAGCTCGTCGCCGGGGAAGAAGGAG 

2461 + + + + + + 2520 

FGKAFGMKVTVISSSPGKKE 



GAGGCCCTGGGGCGGCTGGGCGCCGACGCGTTCATCGTCAGCAAGGACGCCGACGAGATG 

2521 + + + + + + 2580 

EALGRLGADAF IVSKDADEM 



AAGGTAGGCGGACCCGCTGGTTCAGGTTACTTCCCCTGTCCGGTGCAGAAGAAAGAGGAA 
2581 + + + + + + 2640 



K 
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G at 851 bp (coding sequence) missing from cDNA in cv Ellefcfc 

▼ 

CTTGAGGGTTCATGTTTGTTTTGCGTTGGTGATGTCTTTGCAGGCTGTGATGAGCACCAT 

2641 + + + + + ' + 2700 

A V M S T M 

GGATGGCATCATAAACACGGTATCTGCAAACATCCCCCTGACCCCTCTCTTCGGGCTGCT 

2701 + + + + + + 2760 

DGI INTVSANIPLTPLFG'LIi 

CAAGCCCAACGGCAAGATGATCATGGTCGGCCTCCCCGAGAAGCCCATCGAGATTCCTCC 

2761 + + + + + + 2820 

KPNGKMIMVGLPEKPIEIPP 

C TTC G CTC T AG TTG C C AG TAAGT CTTAG GATC TC TTG C AAT AAG GAG AAATC ATGC AC TG 

2821 + + + + + + 2880 

F A L V A 

ATC G ATC AG AG AAATG AGATAG CAT C C TG ATG AAC ATTG TACGTGTG TG C AG CG AAT AAG 

2881 + + + + + + 2940 

N K 

ACCCTGGCCGGGAGCATCATCGGCGGCATGAGCGACACGCAGGAGATGCTGGACCTCGCG 

2941 + + + + + + 3000 

TLAGS I IGGMSDT'QEMLDLiA 

GCGAAGCACGGCGTGACGGCCGACATCGAGGTGGTCGGCGCGGAGTATGTGAACACGGCC 

3001 + + + -+ + + 3060 

AKHGVTAD I EVVGAEYVNTA 

TTGGAGCGCCTTGCCAAGAACGACGTCAGGTATCGCTTCGTCATCGACATCGGCAACACC 

3061 + + + + + + 3120 

LERLAKNDVRYRFVIDIGNT 



CTCG AC AAG GTTG CGG CCAC C ACC G AG TG AACGTACTCAG CAC TG C TTAC GAT CTACG TT 

3121 + + + + + + 3180 

LDKVAATTE * 



GTTCCACTGTTAGTGCTCCGTAGTAAACAATAAACGATCAAAACTCTTGTCATCTGGTGC 
3181 + + + + + + 3240 



ATTGGTGTAGACATGGTTGTTTGCGAGGAAACTGAGTTGAAGGATGGATGGATAAGTTTG 
3241 + + + + + + 3300 



CTTCTTGCCGTGTTAATGGATTACCTACTTAGCTTCACTGCAATTAACAAATTAAGAAAC 
3301 + + + + + + 3360 



GACACACCCAAAAGACTTTCGTCAGTTTTCTTGGATTATACAAGTCGTTATGGTTGGGTG 
3361 + + + + + + 3420 
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TCAGTGTGTCACAGATAATCATACTATGGTATTTAACCTGGAAGATCGTTTTTTTGGCGG 
3421 + + + + + + 3480 

C AACTCAGTGG GTTTTC C CACTATGTATATTTATAAATATTC AAC AAGTCATGAGGTACA 
3481 + + + + + + 3540 

AAGGGTTGTTGCTAGAGGATAGCAACAAGAAGCTAGCCAAAAGATCATAGGCTTAAAAAA 
3541 + + + + + + 3600 

GAG AG AAAAG AAAAC AAAAC TG C TATAGTTATC G AAATC TC TC AG C TC AAATTTTAAAAC 
3 601 + + + + + + 3 660 

CAGCATAAGACTTTCTAGAAGCCTTATGAACAAGAAGAGCTAGCTCATCTTTAAACCTTT 
3 661 + + + + + + 3 720 

TCCTGCATCTGTAAAGATTGAGGGTGCAACCCTTGAATATAAAATCATTCCTGTCATCCA 
3721 + + + + + + 3780 

GATAGACTATGTAGTCAAAATAGTCATTTCCATGAAGAAGGGCACTTTTAATACATTTTT 
3781 + + + + + + 3840 

GAGACTTGGTATG ATAC T CTGAATGTC AAC AC C C TGGAAG ATC TTTTC AC TC C TATGGAA 
3841 + + + + + + 3900 

GGAC AAGAAAG C ATTTC AAC T C C TTTTAC TAAGGAAGAGATTGAC AAGGTGATTCAGAGA 
3901 + +- + + + + 3960 

ATTCCTTTAGACACTATAGAAAGTCACAAGGTGCCAACGGCGCAATCCTGTGCCGACGGC 
3961 + + + + + + 4020 

TTTTTATCGGGGAAGCCAGCATCGGTACCGAGACCGGCAGCCCACCAACTAGGCCGTCGG 
4021 + + + + + + 4080 

CACACATCCTCCAGTGTCGGCGGCCAACATCGGCATAAGTTGGCCCGTTGGGCATCAACT 
4081 + + + + + + 4140 

CCCCCGTCGGAACAGGTCTAGCGCATGGACCGTCGTGATGGCGGCGGCAACGACGTCATC 
4141 + — * + + + + + 4200 

CTATGCCGACGGCCTAGCCGTCGGCCTAGCTTGCCAGCGCTATGCCGACGTCACATTGCC 
4201 + + + + + + 4260 

ATCGGCACATGCTAGTTTTTTTTTCTTTTTTCTACATGCCAAATTGTATATGTATATATA 
4261 + + + + + -+ 4320 

C T C ATTTAC TTATTAC TTC C AATTATTTTAATGT GTATATATTTTGCTC AC CAATTGTAC 
4321 + + + + + + 4380 



FIGURE 26 CONTINUED 



WO 01/95702 



PCT/AU01/00699 



43/76 

GAATTTGT AC C C TC C GAGAAATTGC TAAAATGATGGAGTGAC C T AC AAC GAGC C TT GGAT 
43 81 + + + + + + 4440 



ATGTGAGTTCTTCTTGCCCCATTGCACAAA2U\TTGTAAATATTAGGGTTTACTGGATCCA 
4441 + + + + + + 4500 



CTAGTTCTAGAGCGGCCGCCACCGCGGGGAGCTCCAGCTTTTGTTCCCTTTAGTA 
A) + + + + + 4555 
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GGCACGAGTCGCCTCCAACGTCTTCCCTTAACCGGCCGTCCCTACGCtTGCACCACCACC 
1 + + + + + — — + 60 

ACGCACAGACAGAGCAGTTTCCCAGCCCCCGCCGGAACCGGATGGCACCCACGGCGGCGG 

61 + + + + + + 120 

M A P T A A E 

AGC AG ACGG AG C AC C ACC AG CACACC AGG AAGGCGGTGGGGCTGG CGG CG CG CG ACG AC G 

121 + + + + + + 180 

QTEHHQHTRKAVGLAARDDA 

CCGGC CACCT C TCC C CG CT CG CC ATCAC ACGGAGG AGC ACAGGAG ACG ACG ATGTGG TG A 

181 + + + + + + 240 

GHLSPLAITRRSTGDDDVVI 

TAAAGATTTTGTACTGCGGAATCTGCCACTCTGACCTGCACGCCCTGAAGAACGACTGGA 

241 — + + + + + + 300 

KILYCG I CHSDLtHALKNDWK 

AGAACTCAAGGTACCCGATGATCCCCGGGCACGAGATCGCCGGCGAGGTCACGGAGGTGG 

301 + + + + + + 360 

NSRYPMI PGHEIAGEVTEVG 

GCAAGAACGTGAGCAAGTTCAAGGCCGGCGACCGCGTGGGCGTCGGGTGCATGGTGAACT 

361 + + + + + + 420 

KNVSKFKAGDRVGVGCMVNS 

CGTGCCGGTCGTGCGAGAGCTGCGACAAGGGCTTCGAGAACCACTGCCCGGGCATGATCC 

421 + + + -+ — — + + 480 

CRSCESCDKGFENHCPGMIL 

TCACCTACAACTCGGTCG ACG TCGACGGCACCGT CACCT ACGGCGGCTACTCC AG CATGG 

481 +- + + + + + 540 

TYNSVDVDGTVTYGGYSSMV 

TGGTGGTGCACGAGCGGTTCGTGGTCCGGTTCCCCGACGCCATGCCGCTGGACAAGGGCG 

541 + + — + + + + 600 

VVHERFVVRF PDAMPLDKGA 

CGCCGCTGCTGTGCGCCGGCATCACCGTGTACAGCCCCATGAAGTACCACGGGCTCAACG 

601 + + — — + — + + 660 

PLLCAGITVYSPMKYHGLNV 

TT CCCGGG CTG CAC CT CGG CG TG CTGGGG CTGGGCGGG CTGGG CC ACGTTG CGGT C AAGT 

661 + — + + +n + + 720 

PGLHLGVLGLGGLGHVAVKF 

TCGGCAAGGCCTTCGGAATGAAAGTGACGGTGATCAGCTCGTCGCCGGGGAAGAAGGAGG 

721 - — + + + + + + 780 

GKAFGMKVTVISSSPGKKEE 
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AGGCCCTGGGGCGGCTGGGCGCCGACGCGTTCATCGTCAGCAAGGACGCCGACGAGATGA 

781 + — + + + + + 840 

ALGRLGADAFIVSKDADEMK 



6 at 851 bp (coding sequence) missing from cDNA in cv Ellett 

▼ 

AGGCTG TG AT GAG CACC ATGGATGG CAT CATAAAC ACGGTATCTG C AAAC ATCCC CCTG A 

841 + + + + + ~ + 900 

AVMS TMDG I INTVSANI PLT 

CCCCTCTCTTCGGGCTGCTCAAGCCCAACGGCAAGATGATCATGGTCGGCCTCCCCGAGA 

901 + + + + + + 960 

PLFGLLKPNGKMIMVGIiPEK 

AG CC CATCG AGATTC CTCC CTT CG CTCTAGTTGC CACG AATAAGAC C CTGG C CGGG AG C A 

961 + + + + + + 1020 

PIEI PPFALVATNKTLAGS I 

TCATCGG CGG C ATG AG CGACACG CAGGAGATG CTGG AC CT CGCGG CG AAG C ACGGCGTGA 

1021 — — — + + + + + + 1080 

IGGMS DTQEMLDLAAKHGVT 

CGGCCGACATCGAGGTGGTCGGCGCGGAGTATGTGAACACGGCCTTGGAGCGCCTTGCCA 

1081 + + + + + + 1140 

ADIEVVGAEYVNTALERLAK 

AGAACG ACGT CAGG TATCG CTTCGTCAT CGACAT CGG CAACACC CT CGAC AATGTTG CGG 

1141 + + + + + + 1200 

NDVRYRFVID IGNTLDNVAA 

CCACCACCGAGTGAACGTACTCAGCACTGCTTACGATCTACGTTGTTCCACTGTTAGTGC 

1201 -+ — + + + — + 1260 

T T E * 

TCCGTAGTAAACAATAAACGATCAAAACTCTTGTCATCTGGTGCATTGGTGTAGACATGG 
1261 + + + + + + 1320 

TTGTTTGCGAGGAAACTGAGTTGAAGGATGGATGGATAAAAAAAAAAAAAAAAAAAAA 
A) + + ~ + + + 1378 

GG CACG AGTCG C CT C C AACGT CTT C CCTTAACCGG CCGT CCCTACG C fcTG CAC CACCAC C 
1 + — + + + + 60 

ACGCACAGACAGAGCAGTTTCCCAGCCCCCGCCGGAACCGGATGGCACCCACGGCGGCGG 

61 + + + + + + 120 

M A P T A A E 

AGCAGACGGAGCACCACCAGCACACCAGGAAGGCGGTGGGGCTGGCGGCGCGCGACGACG 

121 + + + + + + 180 

QTEHHQHTRKAVGLAARDDA 
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C CG G CC AC CTCTC CC CG CTCGC CATCACACGGAGGAG C AC AGG AG ACGACGATG TGGTGA 

181 -+ + + + + + 240 

GHLSPLAITRRSTGDDDVVI 



TAAAGATTT TGTACTG CGG AATCTG C CACT CTGACCTG CACG C C CTG AAG AACG ACTGGA 

241 +- + + +- — + + 300 

KILYCGICHSDLHALKNDWK 



AGAACTCAAGGTACCCGATGATCCCCGGGCACGAGATCGCCGGCGAGGTCACGGAGGTGG 

301 + + + + + + 360 

NSRYPMIPGHEIAGEV TEVG 



GCAAGAACGTGAGCAAGTTCAAGGCCGGCGACCGCGTGGGCGTCGGGTGCATGGTGAACT 

361 + + + + + + 420 

KNVSKFKAGDRVGVGCMVNS 



CGTGCCGGTCGTGCGAGAGCTGCGACAAGGGCTTCGAGAACCACTGCCCGGGCATGATCC 

421 + — + — +- — + + — + 480 

CRSCESCDKGFENHCPGMIL 



TCACCTACAACTCGGTCGACGTCGACGGCACCGTCACCTACGGCGGCTACTCCAGCATGG 

481 + + + + + + 540 

TYNSVDVDGTVTYGGYSSMV 



TGGTGGTGCACGAGCGGTTCGTGGTCCGGTTCCCCGACGCCATGCCGCTGGACAAGGGCG 

541 + + + + + + 600 

VVHERFVVRF PDAMPLDKGA 



CGCCGCTGCTGTGCGCCGGCATCACCGTGTACAGCCCCATGAAGTACCACGGGCTCAACG 

601 + + + + + + 660 

PLLCAG ITVYSPMKYHGLNV 



TTC CCGGG C TG CACCT CGG CG TG CTGGGGCTGGGCGGG CTGGG C GACGTTG CGGT C AAGT 

661 + + -+ — + + + 720 

PGLHLGVLGLGGLGHVAVKF 



TCGGCAAGG CCTTCGG AATGAAAGTGACGGTGATCAG CTCGTCGCCGGGGAAGAAGGAGG 

721 + +- + + + + 780 

GKAFGMKVTVI S SS PGKKEE 



AG G C CCTG GGGCGG CTGGG CG C CGACG CGTT CAT CGTC AG C AAGGACG C CGACGAG ATG A 

781 + + + + + + 840 

ALGRLGADAFIVSKDADEMK 

6 missing at 851 bp in the cDNA isolated from cv Ellett 
resulted in a premature stop codon (truncated CAD2) 



AG G CTGTG ATAG CACC ATGG ATGG CATCATAAACACGGTAT CTG CAAAC ATCC CC CTGAC 

841 + + + + + + 900 

AVIAPWMAS* 
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CCCTCTCTTCGGGCTGCTCAAGCCCAACGGCAAGATGATCATGGTCGGCCTCCCCGAGAA 
901 + + — — + + + -+ 960 

GCCCATCGAGATTCCTCCCTTCGCTCTAGTTGCCACGAATAAGACCCTGGCCGGGAGCAT 
961 + + + + + + 1020 

CAT CGG CGG CATGAG CGACACGC AGGAG ATGCTGG ACCTCG CGG CG AAG C ACGG CGTG AC 
1021 + + + + + + 1080 

GGCCGACATCGAGGTGGTCGGCGCGGAGTATGTGAACACGGCCTTGGAGCGCCTTGCCAA 
1081 + + + + + + 1140 

GAACGACGTCAGGTATCGCTTCGTCATCGACATCGGCAACACCCTCGACAATGTTGCGGC 
1141 + + + + + + 1200 

C AC CAC CGAGTGAACGT ACTCAG CACTG CTTACGATC TACGTTGTT CC ACTGTTAGTG CT 
1201 + — + — + + + + 1260 

CCGTAGTAAACAATAAACGATCAAAACTCTTGTCATCTGGTGCATTGGTGTAGACATGGT 
1261 — + + + + . + + 1320 

TGTTTGCGAGGAAACTGAGTTGAAGGATGGATGGATAAAAAAAAAAAAAAAAAAAAA 
A) + + + + + 1377 
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FIGURE 28 
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FIGURE 29 A 
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FIGURE 29B 
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FIGURE 30 
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FIGURE 31 
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FIGURE 32 
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FIGURE 36 
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CGGGAT CAAC T T GGAT G T CC T T TG CGGGC AC GGTTT C AGGAAC AACGAC AC ATGC AGC AG 
-2206 + + — + + + " + -2147 

GGATCTCCTCCAAAGACTCACACAAAGGTGACATGAGCGCCCGCTTTTTTGAAGCCAAGT 
-2146 + + + + + + -2087 

TGGCTAAGAAATCGCAAAGCTTGGTGGAGTCGGCCACCTCAGGATCTGCAACAAAAGGCA 
-2086 + + + + + + -2027 

CCAAGGGAGCTGCCAACACATCAACCACAACATCATGTTCAAACGCAGTCTCCTCAAGCC 
-2026 -+ -+ + + + + -1967 

TCGAAT GC TC AAC C G AAAGAGAGGC AGAAGCT TC AAC AAAAAACT CAGC C AACC CAAAGC 
-1966 — + + — + + + + -1907 

CCTCGACGTCATCAGAGATTAGGCTCTGAGGACCCGCAGGGAAGCAACCTTGTCAACAAC 
-1906 + + + + + + -1847 

CGCATCCGGCAGAAAAGGAGCAAGACCGGAGCAACCCTCAAGAGGCACACGAAAGACGTC 
-1846 + + + + + + -1787 

GAAGC C AAG AG GAGAC G AGT CG C AGGGACGGCGGAC AGGC GAGAAGGGGC CGTAGAACT C 
-1786 + + + + + + -1727 

CAAGAGCTCGGCGTCCCTCGACCTAGCATCCGAAGCACTGACCGGGGCACTCAATGCATA 
-1726 + + + + + + -1667 

ACTTTATCTTGATGGCATATGTACTCAAACCCATACAATGTTCACCATGCATTATCTATG 
-1666 + + + + + -t -1607 

GAACATTCCTT CATATACAACT t CTGAGTGGTCAGTGCATAGGAATTTTCATTAACAACC 
-1606 + + + +-' + + -1547 

AAAAACATACTTGGGGCCTACACACACTTTCACAGCATGGAAAACTTGTTAGCTTTTTAA 
-1546 + + + + + + -1487 

AGAGTTGCAAAATCTGTCAAGCGAATGTTCTTGTGATAATTGGAACGAAGCATGTTTCCC 
-1486 + + + +- + + -1427 

CATTTTCAATGTGTGTCTCTTACCCTAACTAGCACCCGACCAACAAAATCTGACCATCCT 
-1426 + + + + + + -1367 
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AGTTAT AT CAT C AT AG AG ACC C AC ATGT AGGT TGAC C C C C AT AAC ACTT GTGTGGAT ATC 
-1366 + + + + + + -1307 

ATGGAAAATGGCCTTGATCAACACTTTCTTTCCTACTTGGTACAAATGGTTATGGACTTA 
-1306 + + + + + + -1247 

CTCAATTAGTGCTTTAGAGAGCTTTGGCTGCAGACTTTGTAGCTTCCCAATATTCATAGG 
-1246 + + + + + + -1187 

TCCCTCCGGAGTGGGCAGCCCCATCTACATAGGCTCAAAACCAGATTTTTGTAACATGTT 
-1186 + — + + + + + -1127 

AGACAC T T TC AAC T TC AT C AT AGAC CATC AAGGAGC T GGC AT GT GACAGT GATAT ATGT A 
-1126 + + + + + + -1067 

TCAATTACCCATT CAACACGAATAGCTTGCTCATGCATGGTTAGTCTTGCGGC GGCGGGG 
-1066 + + + + + + -1007 

C GGGAC CAT C GAAC AC AC CGC C GGGCGGTC AGT AGGC TAGGG TT AGAT AAAATC T AGCCG 
-1006 + + + + + + -947 

TTTTCATTCAAACTTGTGATATATAATCAAATTTAAATAAAAACCTTTATTTTCGTGCAT 
-946 — + + + + + + -887 

T TTTAT T TAT T T G AGGGC GTGT T TGGGGGAC AC GGC TGGAAAGT GAC ATC C CC AAACACT 
-886 + + + -+ + + -827 

GCACGAAGAAAACGCGTCGCCAAAAAATTCGATCCGGCGTCAGTCCTTTGGGAGACGATT 
-826 + + + + + + -767 

TGGAT GACGC GGC TAG AGAT GC T CT AAGT TCT C C ACGC C AT GTT T CTT TC T ATAT ATACA 
-766 + — + + + + + -707 

C ACAGC C C AAGGT C CAT GAAAAGT AAAACGGC ACGAC GACAC GC AC CGGC GACAACTTC A 
-706 + + + + + + -647 

CATTACGGCACATCGCTATTACGGACCACATACAACTCCACCGCTATTCTCAGCCAAGTC 
-646 + + + + + + -587 

ATACATGACATGATCCAATGGACGACTTTGTGAGCGAAACTAGAACCTTGCGGGGTTTAG 
-586 + + + + — + + -527 

ATTTTCCAATGTGGATAAGTTGTACGCGCCGACTAGCTTTACACTTGGTTGAAAAAAGCT 
-526 + + + + + + -467 
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atactcctactgtatattggtaaaacaaaacattttcttttttatttgataggagtgctgcaaattaaagttctttgtgtca 
tttttcaaaggaaaaaaaaaacacctttaccactcttcttccttgccatcattttttttttaccaaagtttgttctgtcaaa 
tgaacatatatatagttcggtgctatgtcagtgccatttaccggccactagctagtaggactgccatgttccagcaaattgt 
c t ag t ggac cggag t ggcc aaaaggagccaa 1 1 a t g t agg 

gttgcaagcgggatcacacaaaagcctcgcctctagttcattttatcaattaagtggtactttctcagggaccccccttgca 
actctaccattacatccgtgcaaaataaaagctagcatcacgcaccagatttagtactccctccgtttttatttagttcgca 
ttctaggttcagccaaagtcatactttgcaaagtttaaccaaaattataagaaaaaaatatcaataatcatcatacaaaata 
catataatataagagtaaaccttataacgattctacaatagatttttttattgcatatgtcaatattttttcataaatattt 
actcaaaattataaggtttgactttgactaaacccagaaccttcttagagaggaagaaatgcatgggcaaaagcaaatcatg 
catatgggcaggagtaacatttttttgactttcatagaaagtactgtatggcactaaacggtctaaaccggacactggaagc 
aaatcgtgcacgtgggcaatattatctaccgtcgcgtcgccagtctccccatgcccatgaccatgcttggaattttagtctc 
gccggagctgccgagtgcatgcatagtgacgagtttcaataggccactatatatgtgatcatggctcttgatttgtcacttt 
ctttttttgccgaaggatatagtagtattactttctctgctatcacaaagaaagaactgattgtgtctagtctaggtggtct 
cagaattctgcatgactccagagtattcttgatgccacttgtttgttattgcaagaaacttaattcggagacaaccaaaagc 
tcatcccatgtctctggaactagtagacataagaaaatctcatggtatcagtttgctatttatctacaactgaaacggcatg 
tttggttttattaaattcagaactggtactgctacgggaaggcggttgcggagcaggcggcatcggagttggcgcggcagcg 
cggcgtggaccttgtggtggtgaacccggtgctggtgatc 

ggccccctgctgcagccgacggtgaacgccagcatcggccacatcctcaagtacctggacgggtcggccagcaagttcgcca 
acgccgtgcaggcgtacgtggacgtccgcgacgtggccgacgcccacctccgcgtcttcgagtgcgccgccgcgtccggccg 
ccacctctgcgccgagcgcgtcctccaccgcgaggacgtcgtgcgcatcctcgccaagctcttccccgagtaccccgtcccc 
accaggtacgcgtacgacctgcttgctagccgcttccgttaattccattgccttaattgattgcatgatgccgctcctaatt 
tactcacttgcgtaactaattgcattcatatatgatctaccaaccgtggagaaaattagcaagagtctgtcggggcgtcccg 
gtccagtgcagttaacctgcatgtcgatggtctgcaggttgcagcttacttgtggttctttagttcagagacacagagcaat 
tgggcactaagcaaaactgacatcactggtaattaggtagctcccacacactgaagtgggtggatcccatcggtagtaggta 
agggtggatagtactggacgagagctcgatcgttgttgtaaaaaagcgagtgaccaccacttcaccatccactgcaagtagc 
tgctagtgaaccatccaaccagctccctggatcactctgctccgtccgtaccttcagctacctacagaagcgacatgaacac 
acagacacacaaggccggctcaccattcgcataggtcaaaccaaatgttggtgaacggcaacatcgccacaagtcgcgtgct 
agttcgaggttgtgtccggtgtaccgaggccacactattcgtgctgcccgtcgctgatatttgcacgcgtagctgtcgacga 
aagtaggtggactgacagatacacatatcctcattgccttctctgctcggtttctgctaggattgccatcttcaggagtgcc 
tatccgcacggcagaaacgcgtagcatcaggccagaaagc 

agcgtgcgtgatatcgtaacccagacggtcttcacctgtccattctgggctacctggcatactacctcggtgccgctgtgcc 
gctgaccaattcgtgcacgaccactatagcaaaaccctatgcatgtaactgcttcaagatcagcagtgacatgtgcaatata 
aacctcaagtgtgcactctagtgcgtactgataaaaccgtataactggtgacccagtcattcttctcttttttatttgtttg 
gaccaaacgaacacagcatgttatccatcaccaacaagtggcgctgatttttcaaactacactgggatcatactggaaacca 
aagcaggagaacatcttcgaaccaagagatgtttactaaatttgaaagaaaatgtactgacaagtaatctgtctgaagcaag 
acacatactacctcggttcgaacgtgggacaccatgcccgtgccatatttgctaggcaccactctgccgtcgattgtatccc 
aacggagggagtatcgatttgcgcaaagttcctacatacatagccgctcaagatataatcttacgaccttccgtcgaaatcg 
gtgatacgtcgcaacctatagctaacttggcagagcataaaataactatctaaggttggggtctccctcttttcaatcaacc 
tttcataccgaatgatgggagtgtttgtgaaaacatctcttggtcgactcagcattagcgccctaccaatttctctgtggac 
aatgccaccttaaatcgttttttagtcttcatgatttactcccccttatatctggccgtagtccctcttttccatttttctt 
gtctggttttaagtcaaatttagactactaaaacaacagcaagattttatggaagggaggtagtgcaaaacagaaagtccga 
tcgaaatgcgtgccaatttgtcgtcgcggcggccggactaaaatggatctgcatgtgcataccgttcgtcggagtatcctgc 
gaacggtcgtgtgtttagtcaacattaatgtgaggttcat 

gtgatactcttgcttgaaagatactactactgctacctcgtagaactgaatgaaagtatgtgggactgttcagctctctgca 

catgtcaaatgtcgttactcatacctttcgtcagagcatcctgcgacgcgcgccggtgccgaaatttcgccgtgtgtttagt 

caagatcaacgtgaggttcatgcggtaccctatctggcttcgaagataccaagcagactgcggctagattgtcattttgatg 

tcgcaatcttcaccaaacctgcccttccggaccacagcagcagtacgtaacaatggtgtcatcgccatgcgttgctcgtgtc 

caaggaaacggaggaatctcggcttcccacaagtcacgcatcgatgttcacacctgaattggtcgacgtttcttcttctaga 

ctagaaaaagattacagaacaacgcaagcttcgttcaagtccatacttctgttcagtatactcctgatgattgcagttatat 

cagcatgtctattctgaatttttgcacttctattcaaaggatgggctggaattgctactgactttggtgtgatgtgtgtggc 

acaggtgctctgatgagacgaacccgaggaagcagccatacaagatgtcgaaccagaagctccaggacctcggactcgagtt 

caggccggtgagccagtccctgtacgagacggtgaagagcctccaggagaagggccaccttccggtgctcagcgagcaggca 

gaggcggacaaggaaaccctagctgccgagctgcaggcaggggttaccatccgagcatgaggaacaagaaatcaaccatgtc 

catactgctactgtcatgtaaaccagctgttgaatgcctaaaatctaagttcttgtaatactgtgttgtttcatgtggacta 

gattgatcg 

<212> Type : DNA 

<211> Length : 1217 5 

SequenceName : CCRlgenomicseq (SEQ ID NO: 18.) 

SequenceDescription : 

Custom Co don 
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TATTGTAGCACGACTTCTCACTGACATAGGAATGTAAACAGTCTCTCCACGCCATGTTTC 
-466 + 4- + + + + -407 



TTTCTAGTAGTAGCATACTAGTAGTAACTTCTCTTTGTCCTACACACACCCAGGGTCCAA 
-406 + + —4- + + + -347 



GAAAGG AAAAC GGC AC GACGGCACCC ACCGAC GACG ACGAC T CC AC ATC ACGGT T CGGT A 



AAAAAAGTCAAAACTCGCTGACGTGGCACCACCGGTCGCAGTCAACTGACGCGCTCCTCT 
-286 + + + + + + -227 



GCGCAGGTyTCACTtCAAGTTTCACCTACCACTGTGGGCCCACCGCCAaTGTGGGCCCCG 
-226 -~ + + + + + + -167 



CGAGCTtCTtACTCACTGACCTGTCTCCCACCAGCCTCCTCGCCGGTATATTACCCCGGC 
-166 + + + + + + -107 



CCCCAATTTCCTCTGCCTTCCCACGAGCAGCAGCCGGAGCACGGAATCCCGGCCGCCATT 
-106 + + + + + + -47 



CCTCCACCTTCAGCTCCGCCCAAAGATTTCCATCCGGCGAGATCCATGGGCTCCATCGCG 

-46 + + + + + + 13 

M G S I A 



GCGGACGCGCCTCCCGCGGAGCTGGTGTTCCGGTCCAAGCTCCCGGACATCGAGATCCCG 
ADAPPAELVFRSKLPDIEIP 



ACCCACCTGACGCTGCAGGACTACTGCTTCCAGCGCCTGCCGGAGCTCTCCGCGCGCGCC 

74 + + + + + +- — 133 

THLT LQDYCFQRLPELSARA 



TGCCTCATCGACGGCGCCACGGGCGCCGCGCTCACCTACGCCGACGTGGACGCCCTCACG 

134 + + + + + + 193 

CLIDGATGAALTYADVDALT 



CGCCGCTGCGCCGCGGGCCTCCGCCGCCTGGGGGTCCGCAAGGGCGACGTCGTCATGGCG 
RRCAAGLRRLGVRKGDVVMA 



CTGCTCCGCAACTGCCCCGAGTTCGCCTTCGTGTTCCTCGGCGCCGCCCGGCTCGGCGCC 

254 + + + + + + 313 

LLRNCPEFAFVFLGAARLGA 
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GCCACCACCACCGCCAACCCGTTCTACACGCCCCACGAGATCCACCGCCAGGCGACCGCC 

314 + + + + + ' + 373 

ATTTANPFYTPHE IHRQATA 



GCCGGGGCCAGGGTCATCGTCACCGAGGCCTGCGCCGTCGAGAAGGTGCGCGCCTTCGCC 

374 + + + + + + 433 

AGARVIVTEACAVEKVRAFA 



GCCGAGAGAG 

434 + 443 

A E R 
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T CGAC GCGG C C GCGT AAT AC GAC TC AC TAT AG GGC GAAGAAT TC GGAT C AT ATGGAT TCG 
-6735 + + + + + 4- -6676 

ACACTGGAATTTACTCCCATCGGGAGCGTGCAAACAAAAAGGTGTTATAGCAAGAAGACA 
-6675 4- + + + + 4- -6616 

CTGGCAACATTGCCAGCACAGAATTTGTTACAATCATAGAAAGTTTTATGACAGGACATT 
-6615 + + + + + + -6556 

GTTTCAACCGAAAGCAAGATTACAACAATATAATCAAGGGCTTGGGTCTGGTTGGACATG 
-6555 + + + + — + + -6496 

C T CGGTC C AAT GGACGAT TTAT TTGCC GAG AC C AGC T C AAGGAGT TGACGAGC AC ACTTA 
-6495 + + + + + + -6436 

AGCGCCGAGATCTTAAAGGCACCCAAGTCAACAAGTCGCCCATCTTGCTCTTTTGGCAGC 
-6435 + + + + + + -6376 

TCCTTGGACATCTCTTCGATATTGGCTTTGAAGCCATGACCCATCATAAGCTGAAAGGCT 
-6375 + + + + + + -6316 

AGGAGGGCACCATAGGTACGCGAAGTACGTTTGAATACCTCGAGGACCTCCCTCGTGTTG 
-6315 + + + + + + -6256 

ATGGCGAAAGCATCGATCAGCTGCCCCAAGGTCTTGTTTTGATCGATCTTGGGGAAGATC 
-6255 + + + + + + -6196 

ATCGAGTGCATCCGCGTCATGGATCCTTTACCCTTCTGAAGGAGGTCCTGAAAAAGCTGG 
-6195 + + + + + + -6136 

T GAG ACC CG AGGGT CAT T GAC AAAGC ATT CGC C GGAGAATT ATT CGGCAATTTAT C TAGA 
-6135 + + + + + + -6076 

GC C T C AGC AGGGAT GT AGGC AGC T TC TGGAG AAAGTGAAAGAGG AGGAGC TCAC T AACCA 
-6075 + + + + + + -6016 

AAAT CAAATC GAT AAAGC AAAAATC GGAAAGG AGGC CAAAAGGGGATTACTGAGCAAGGC 
-6015 + + + + + + -5956 

CAAGGAAGATTGGCGAAGGAGCTCATCTTTTTCAATCGCCCGAGCTTCGGCAGCAAGCCT 
-5955 + + + + + + -5896 

GGATGCCTCTTCATCCTTCAGCCTCTTTCTTAGCCCCTCGAGCTCATCCTTAAAGGAATC 
-5895 + + + + + + -5836 

AACCTCCTGGCGGGCCTCGGCAGCTATCTTTATCGCACCCTCCAGCTTCGAGGAAGAAGA 
-5835 + + + + + + -5776 

CTCGACCTCCTTTTGCAGCCGAGTCTTGTCAACTTCCAGAGAAGTGTATTGGGAGGCGAA 
-5775 + + + + + + -5716 

GGC C T CCAG AGAAG AGAT AAC AGCT C AC AAAT C CTT AAGAGATAAGGAAAAATAAT TAGA 
-5715 + + + + -+ + -5656 

CG AAGAAC T GGTTGTC AAC AAAC T T AT AATT TGAT C AGGGAAAT CGTC CC ACAT GGAT AT 
-5655 + + + + + + -5596 

AT C GT TAAAAC AGG AAAAGCTT AC AGGTT T C C CT GGAGGAGAAGCTGT AACC AC GGC AGT 
-5595 + + + + + + -5536 
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CAAAGAAATCTCCTTCCCTTTGGAAAGGGAAGAAGTTGTCGATATTTGAGCCATGGGGGC 
-5535 + + + + + + -5476 

TGCGGCAGGAGTCGAAGCCTCGGAAGCGGCTGGATTCGGCACGATGGCACCAGATTTGGC 
-5475 + + + + + + -5416 

CTTCTTGGCCGGAGGCTCGATGAAGCCATCTTCACTGCAAGAACAAAAAACTAGCGAAGT 
-5415 + + + + +— + -5356 

CAG AAT TC AAT GCAT AT GGCG AAG TT AGAAC AC AAT CCT GGAAAAGGAAGCAAGGACTT A 
-5355 + + + -+ + + -5296 

CAATTCATAGAGACCATCTTCATCGGCAAAGCCGCCGGATGATCTCTTTGGAGGTAGTGC 
-5295 + + + + + + -5236 

CTCGGCCTTTTCCGTAGCTGCATCAACAAAGGCAGCACGATCAGCATCGTCATCATGCAT 
-5235 + + + + + + -5176 

TGACCCCGCTGTATCGCTCATATCATCGGCAGAGAATCGAGGATTGATGGAAAAAGCCTC 
-5175 + + + + + 4- -5116 

AGGATT C AT CGGAT CATC AT GTT GAT C T AT CGGGC T TGC AT TC CCT AGAGTAT GGGACC C 
-5115 + + + + + + -5056 

TACAAGGACTAAGGAATCCCTTTTCTTGGAAAAATTGTTCGACAGGTCTTGCAAACGTTC 
-5055 + + + + + + -4996 

AAGAGC CGTAAGGATC T GTC GT AGTT GAC GAGTG AGAAT AATGGC AGTTAAAATAAT CAA 
-4995 + + + + + + -4936 

AGGAAC AT GAC AAT AAGAGC AT AAAGG GGAAATT TACCT CGGTT GGCAGAT GACC AGCGT 
-4935 + + + + + + -4876 

C AAAT GGCGGTTGAGGAGAT AT C AGT GGAAT TG AATC TTC CTGGCTAAAGAGGGTGAGAC 
-4875 + + + + + + -4816 

ACCGGACTTCGTCAAGCAGTTCTTTTTCGGATAATTCAGCAATATTTACTCTAGTCTCGT 
-4815 + + + + + + -4756 

C C CT GGGAC CC GAATAC AAC C AC ATC GG ATGGGT C CT AGACAT GAT CGGC TGAAC TC GAT 
-4755 + + + + + + -4696 

GT TT T AAG AAC AC AGCG GC T AC C TC AGTAC C TAT C ATGGTTTG ACCATCGGATT CT TT GA 
-4695 + : + + + + + -4636 

TCCGAAGGAATCTATCAAATAACTTGTCTACTGTTGGTTTTTCATCGGGTGAGAGGATAT 
-4635 + + + + + + -4576 

TTTTCCAAGACTTCTTGGGCTTGCTTCTAGAACATCGGAGAATTGGGGGGGAGCTGGGAG 
-4575 + + + + + + -4516 

TCGGCTGCTGATGAGTCCTTAATATAAAACCACTTCAGCCTCCAGCCTTGCACGGATTCT 
-4515 + + + + + + -4456 

TT C ATCGGGAAGT T GAAGTAGT T GAC TTCC T TAC G AGC AACAAAACCAACC CC ACC AAT G 
-4455 + + + + + + -4396 
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ACGAAG GACC C AC C AC TGCT GT TAT ATC T T TT C ACGAAGAAAATC T TCTT C C AC AAAC CA 
-4395 + + + + + + -4336 

AAGTGGGGCTCAATGCCCAAAAACGCTTCGCAGAGGGTGATAAAGATGGCAAGGTGAAGG 
-4335 + + + + + + -4276 

ATT GAGTT GGGGGT TAACT TC CAT AAT TGAAT CT CAT AC ACTC GAAGGAGGT GGT GAAGA 
-4275 + + -+ + + + -4216 

AAT T T GTGAGC GGGAAGC GAAAGACC T CGGTAC AAG AAGG ATAAG AACAT C AC AGTAAAA 
-4215 + + + + + + -4156 

CCGGCAGGAGGATTGGGCCGTGAAATTGCACCTGGAAGAATAACATTCCCCTCGTCAGAA 
-4155 + + + + + + -4096 

GAAATTATTACGAGGCTCCGGGCCCTCTTTTCATCTCGCTTCGTGGTGGTAGAAGCTGGC 
-4095 + + + + + + -403 6 

CAATCGCCAGGGATAGGCCCGGCCGTGGAGCTTGACGGCGCTGGCGGTGC'CGGAGCTGAG 
-4035 + + + + + + -3976 

GGAGGAGCATCTGGCGCGCTTCTCCGCGGCGGATTCGAAGGAGCCCTGACGGTGGTGCCA 
-3975 + + + + + + -3916 

CTGCTCACGGCGCTGGTGGCGAGAGTGGGATTCTTCTTCTTCACCATTGTGAGATTTGAG 
-3915 + + + + + + -3856 

GGAGATCTGGGAGTTGCGACGGTGGCGTGGTAGTTGCAAACGAAAAGGATGAATGAGGAA 
-3855 + + + + + + -3796 

GAAGGGACGC AAGG AT GAAGT GTGGAAAGGGG AGTT T ACCCC AAG AGATT AT AAAGTGAA 
-3795 + + + + + + -3736 

AGGAAAACCTGAGAATTGAGCGGGCACGTGTCGTTGCTCTCAATTTATTGAGGGGATTTT 
-3735 + + + + + + -3676 

TTCTCATCATAGATCGCGGAAATCGAGGAGTCACCTTGGTAACTGCACGCAAGTAGTGGT 
-3675 + + + + + + -3616 

CATTTCTTAAACAGAACCGCATAGAAGTAGGATGGGACCGTCAGGTCACGTCCTATCAGT 
-3615 + + + + + + -3556 

C AG AT T T ACAACAGT AATT AC AT CATC ACTGACGTC AAAGTATGC TTGAAG TAT C CGAAG 
-3555 + + + + + + -3496 

AAAAGT CGAAATT T GG GCT CG AAGAC TT T CT T GC AGAGAAGCGC GTGAAAGGAAT AT CTA 
-3495 + + + + + + -3436 

AGGAAAGGGTCAAAACATTCGGCTCGAGTCTACGCACGGATTGCAAGCATCCGTACCTAG 
-3435 + + + + + + -3376 

ACTCGGGGGCTACTCCCATCGGGAGCGCTGGACGTGCACCCGATAAATTTAGACGAGGAT 
-3375 + + + + + + -3316 

GAAAAC C GGAAAC CC AAGTGC T ACTC C C AT CGGGAGCGCCGAT TAC GC AC C C GAC AAACT 
-3315 + + + + + + -3256 

TTTTTGCACTCCAGGATCATGCCCGGGGACTTAATTCTGTGTAGAGTAGCGTTGTTTTGT 
-3255 + + + + + + -3196 
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CTTCGGCAGTTAACCAGCAAAGCTGGACACGTTACTCAATATCCTTTACGCAl'TAAACCC 
-3195 4- + + + + + -3136 

T T AC TT GAAGAAT T G AAGCC CC GATGC AAAT AT AT CGGATG AC CT ATGAAGGC C T GCGG A 
-3135 4- 4- + + + + -3076 

AAGC T TCGGGAGAAG AAG AC AT TC GAGTGGC ACAAC TTGAGT C T AC GAACGGAT T GC AAG 
-3075 + 4- 4 4 4 4 -3016 

CATCCGTACCTAGACTCGGGGGCTACTCCCATCGGGAGCGCTGGACTCGCACCCGATAGA 
-3015 4 4 4 4 4 4 -2956 

AGGAGATGATG AT AT T AC AAGAAGGAC AAGAAGTATC AAGGG AGAAGAAC ATT C GGTGG A 
-2955 4 + 4 + + 4 -2896 

GGCATGCTTTAGTCTCTACCCGAAAAAACTTCGGCTAGACACTCGGGGGGCTACTGACGT 
-2895 + + 4 + 4 + -2836 

GGGCATTACCCTTCGGGTAACTGATATTGCCCTATCCTGTACGACCCAACTGGAGGCCCA 
-2835 + 4 4 + 4 + -2776 

TGAAGACACTCGAAGGCAAGGTGGACCACTACGTCGGTGCCGAAGGGGGTTCCTTGAAGA 
-2775 4 4 + 4 4 + -2716 

ACAAGACGAAGAAAAGAAGAATACAAGAAAAGTATAGAACTAGGATCTTTTGTAACCTGG 
-2715 4 4 4 4 + + -2656 

TCGTACCCGGACAGATCTCTCGAGACCTGGCCCCCTACATATGGGCTAGGAGAGGGGCTG 
-2655 + + + + + + -2596 

C C GAGAGGGAC AC AC AC AAT C TT AGC AAT TTT AGC C AC C AT AAGT CC AG AGC AAGGT CC C 
-2595 + + + -+ + + -2536 

C GT AGAAC T TAGC C TC TCGACGAGAT C AC AGCC GAAACCT TC GGCAC CC C ATT G T AACC C 
-2535 + + + + + + -2476 

GAT AT TT TC AT AGT C AAGAT C AGAC AGGTAGGACGT AAGGGT TT TACCT C ATC G AGGGC C 
-2475 + + + + + + -2416 

CCGAACCTGGGTAAATCGCTCTCCCCGCTTGTTTGATAACCGATGGCTTGTGTCAGCTTA 
-2415 + + + + + + -2356 

CAT G AT TC C ATC T AC C CT AAAC CTC AAAC GGAGGGC ATTGC C GAGGAGT AC C C T C GACAT 
-2355 + + + + + + -2296 

T C C C C TC C AC C AATG GT CTC AC ATAAATT CAAC AAAGC AAAC T C AT AAAAAGT T T AATG A 
-2295 + + + + + + -2236 

GT TT C AGAAAGAAAT AAAAC T AGGCC CC TC CTT TGAGAATC T AC GAATGATTC AC CAT AT 
-2235 + + + + + + -2176 

CATCTCGCAGTTAGTGATGAGTAACTAAGTCTCAAATTTCCCGACGCATGGCGAAAAAGG 
-2175 + + + + + + -2116 

T AGC G AAC T T AAAATG TGAGGAATGAATGCC ACAT AT GCAT GGT GC ATC GAGT AT TCTC A 
-2115 + + + + + + -2056 

T T TT AGTC TT GG AT TAC TCC C T TT AGAT GTT GAC AC CAT CCC AAAAATAC AAC T TGGAC A 
-2055 + + + + + + -1996 
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AGTTGTTCATTTCACTAGTATGAATTTCAGTAAATCGGGCAATACTCCAACAGTCATTCA 
-1995 + + + + + + -1936 

CCCCCTAGGCGAGGTTAGCTCAGATCAACGTCGGGTGTCTTCATCGAGTTAATGTCGTCA 
-1935 + + + + + + -1876 

C ACGC AC AC AC ACGT AC GC GC AC AC AC ACGT GC GC AAAC AAAAAGAAAAC TAGGAACCT T 
-1875 + -+ + + + + -1816 

CTCACGTAGCCTAGGTCTTGTCCTGTAAGAAAAAACCCAGGTCCACCCTAGTTTCGAACC 
-1815 + + + + + + -1756 

AAAATAT TT T TG AAGAT AC AT TAGTAAG ATATT TT TGAAAAT AAAAC CG C AAAAAGGGAA 
-1755 + + + + + + -1696 

TTGAAAAATATGGACTGGCTGTTTTGTCCAAAACCACATCTTTCGGAGAACCACGAGGGT 
-1695 + + + + + + -1636 

ATCTATTGATGGGCTCATACTATACCTGGGCATGTGTTGGGCCAGGCCTCATGTCGGGCC 
-1635 + + + 4- 4- + -1576 

GAGGAAAGCCCGACGCTGAAAAATCAGGCCCAAGCTTAACCCGGCCCGACCAAATACCCA 
-1575 + + + — + + + -1516 

CCAAACCCGTTGGGCCATCAGGTTGCGGGCCGGGCAGTAGTGTAAAACACCGATTTCGGG 
-1515 + + + — + + + -1456 

CTACATAGGCCCGGCTCGTTTGTCGGGCAAACATTTCTAGACCTAAGCCCGAGTTTTTCG 
-1455 + + + + + + -1396 

GGCCGGGCTGCCCATGGCCAGGTATAGCTCATAACGACGTATGACATTTCGAGCAATTGA 
-1395 + + + + + + -1336 

TGCAAAGCACGTGTAGGGTTTTATCCCATCCGTGTGGCGTGTGTAGGGTGTAAATGAATA 
-1335 + + + + 4- + -1276 

GGATAATTTCCTCGCCGAAACTGGTCCCAAATTCGCTTTGAAGTGTCCATATATGATTTT 
-1275 + + + + + + -1216 

AAAG AAT GT GAC AAAT AAAG AT ATC C AATT TCG AAAT AGT GC TCCGGATACGGT AT AGGA 
-1215 + + + + + + -1156 

TAT GG T AT AGC AAAT AAC AT GC TGAT ATGGAT T GT CC GATAT TAAATT AAGAT AATC C AA 
-1155 + + + + + + -1096 

ATGTTTTAAACCGCATAATTCGATTTTTGAGTCAAAAGCGAATGCCAATTCAGAAGGTTA 
-1095 + + + + + + -1036 

GCAGTTATTGAGTTTCAAAATTTATTTGGCGAGCATATCTAGTTCTAAATTCTATCACGT 
-1035 + + + + + + -976 

AAATTGTGTCTTTTTTTAATAACTACACAAGACTAAAAGTTTAAATCTCTCTCAAGATTT 
-975 + + + + + + -916 

GCG AAAAC T AT AGC T ATC T ACTGATAT ATAT ATCC GAC T ATATT TGTT T TC GG AC CGC AT 
-915 + + + + + + -856 

GCGTCCTATTTCCGATTCGAATCTGCACTCCGATATATCCACATTGAATCTAAAACCGAT 
-855 + + + + + + -796 
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CAATATTTGCTCCGATCTAAATCCGGAAAAATATGTGGTGAAGGATATGGTATAAGCAAA 
-795 + + + + + + -736 

ATCCGATTTGATCCATTTGTACCTCTAGGCGTGTGCAAGACCTGGAGGAAAGAATGGCGC 
-735 + + + + + + -676 

ATCTGTAGGGTGCAGTCCCACCGGTGGAAAATGTGAGCTCACCGTATTGTCCCCCGATGG 
-675 + + + + + + -616 

AG CAT C GAAACGGAGT CGGAAC AC GAT TTGCGC C ACGT AC AGAG C ATGC ATGAT T T CCCT 
-615 + + + + + + -556 

TGTATGCGGTCCAGGATCTTAAACTGCCTTCCATTTCCAGGAACCTACCGATTGGCTGCA 
-555 + + + + + + -496 

AGCCGTAGCTAGCGGTTTGAAGTCACGGCATTGCCGCCCCCGATTAACCCACCCGTCGCG 
-495 + + + + + + -436 

CGCGCGGTCGGTCGTTTCACCGTCCTGCCTAGGCTACGCACGCGCGCGCGCAGTTGGGCC 
-435 + + + + + + -376 

AGTTGTAGGTAAGCCGACTCGAGATCACACACCCGGCCTCACCTACTACCTCTCGCCGTC 
-375 + + + + -+ + -316 

GCGGTCACCGTGTCACACTCACGCCCAGGGGAGCCACCCGCCCACACGGCGCCTAGCTCA 
-315 + + + + + + -256 

TCCCCTCTCACTACTCTTCTTCTCCTCCCTCTCACCTCGCCGTCGACCCAGCTCCCGGCT 
-255 + + + + + + -196 

CTATAAATTCCGCACTACTCGAACCAACATCGCCCAGGCCTTTGCCTTTTACGACGAATC 
-195 -+ + + + + + -136 

CTACCAAACCGAGCTACCAGATCCTTCTCTACTAATCGAGCTCCCTACGCTGCTCCGCCT 
-135 — + + + + + + -76 

GTCTTCGTTTCCGCCTCACCGCCGGCCGGTTCTCCGCTCCAAGCTACGTCCGTCCGTCCA 
-75 + + + + + + -16 

CATATATAGCATCGACATGACCATCGCCGAGGTCGTGGCTGCCGGAGACACCGCCGCCGC 

-15 + + + + 4- + 44 

MT IAEVVAAGDTAAA 

GGTGGTGCAGCCCGCCGGGAACGGGCAGACCGTGTGCGTGACCGGCGCCGCCGGGTACAT 

45 + + + + + + 104 

VVQPAGNGQTVCVTGAAGYI 

CGCGTCGTGGCTCGTCAAGCTGCTGCTGGAGAAGGGGTACACCGTCAAGGGCACCGTCAG 

105 + + + + + + 164 

ASWLVKLLLEKGYTVKGTVR 

GAACCCAGGCATGTCACCCATGCATTCATCATTTTCTTACTAGTCGTATGCGTTATGCGA 

165 + + + + + + 224 

N P G 

C TT GT GT AT TAAC T ATT GTGG AC TGC ATGC AG ACGAC CC GAAG AACGC GC ACCT GAGGGC 

225 + + + + + + 284 

DPKNAHLRA 
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GCTCGACGGCGCCGCCGACCGGCTGGTCCTCTGCAAGGCCGACCTCCTCGACTACGACGC 

285 + 4 4 4 4 4 344 

LDGAADRLVLCKADLLDYDA 

CATCCGCCGCGCCATCGACGGCTGCCACGGCGTCTTCCACACCGCGTCCCCCGTCACCGA 

345 + 4 4 + + 4 404 

IRRAIDGCHGVFHTASPVTD 

CGACCCCGTACGTACTCCATAGAACTCGGCACCCCTAGCTTCTCTCCGTTCTCTCTGTAT 

405 + + + + + 4 464 

D P 

GT CT GT C AC C GT C GAT C GC CAT GG C AG C AC G C AT GC AT GC GC GC GC AAC GC T AG C T AGAC 
465 + + + + + + 524 

GCTGACCGACTCATTGTGCAGGAGCAAATGGTGGAGCCGGCGGTGAGGGGCACGCAGTAC 

525 4 4 4 + + + 584 

EQMVEPAVRGTQY 

GTCATAGACGCGGCGGCGGAGGCCGGCACGGTGCGGCGGATGGTGCTCACCTCCTCCATC 

585 4 + 4 4 + 4 644 

VI DAAAEAG TVRRMVLT SS I 

GGCGCCGTCACCATGGACCCCAACCGCGGGCCGGACGTGGTCGTCGACGAGTCGTGCTGG 

645 + + + + + 4 704 

GAVTMDPNRGPDVVVDESCW 

AGCGACCTCGACTTCTGCAAGAAAACCAGGGTGGGTGCTGCATGCTCAATTTTTATTATC 

705 4- 4 4 4 + + 764 

SDLDFCKKTR 

ATAGCTACCCTTTTTCTGCACCATGCTGCATTTCTTTTCCAAAAACAACTCTCAAAAGAT 
765 + 4 4 4 + 4 824 

AT GC T AC GTGGT GAGT T C C TAT AG C T G AAT TAT T AC AAC T AC C AC C CT AT CGAT C AC TAC 
825 4 4 + 4 4 + 884 

CGCCCTAAAAGTGTTCAACTTTTGAAGGCAACCAAAACCAATACATGAACGACGATCGTG 
885 + + + + + + 944 

TGCGCTTGTCGTCGTTATCATTAGCCTCTGTAGCTCTAATTTTCACCTATGTACGCATGG 
945 + + + + + + 1004 

AT AGAC GAT T C GGAAAT AC AGT T C AG TT T AC C TAC CAT AT AC TATGC C GAAAT C GAACGC 
1005 + + + 4- + 4- 1064 

ACACAGGTGTGAGGCAGCAGCCGCTCACGAGTTATGCGCCGAAACCGACATCTCGGAATC 
1065 + + + + + 4- 1124 

TTCAGTCCACAATCAAAAAATAGACACCTGGTACCACTACAAAATTATACTCCTACTGTA 
1125 + + + + + 4- 1184 

TATTGGTAAAACAAAACATTTTCTTTTTTATTTGATAGGAGTGCTGCAAATTAAAGTTCT 
1185 4 4 4 4 4 4 1244 

TTGTGTCATTTTTCAAAGGAAAAAAAAAACACCTTTACCACTCTTCTTCCTTGCCATCAT 
1245 4- 4 4 4 4 4 1304 
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TTTTTTTTTACCAAAGTTTGTTCTGTCAAATGAACATATATATAGTTCGGTGCTATGTCA 
1305 + + + + + + 1364 



GTGCCATTTACCGGCCACTAGCTAGTAGGACTGCCATGTTCCAGCAAATTGTCTAGTGGA 
1365 + + + 4- + 4- 1424 

CCGGAGTGGCCAAAAGGAGCCAATTATGTAGGGTTGCAAGCGGGATCACACAAAAGCCTC 
1425 + + + + + + 1484 

GCCTCTAGTTCATTTTATCAATTAAGTGGTACTTTCTCAGGGACCCCCCTTGCAACTCTA 
1485 + + + + + + 1544 

C CAT TAC AT C C GT GC AAAAT AAAAG C T AGC AT C AC GC AC C AG ATT TAGT AC TCCCTCCGT 
1545 + + + -+ + + 1604 

TTTTATTTAGTTCGCATTCTAGGTTCAGCCAAAGTCATACTTTGCAAAGTTTAACCAAAA 
1605 + + + + + + 1664 

T T AT AAG AAAAAAAT AT C AAT AAT C AT C AT AC AAAATAC AT AT AAT AT AAG AGT AAAC C T 
1665 4- + + + + + 1724 

T AT AAC GAT T C T AC AAT AGATT TTT T T ATT GC AT ATGT C AAT AT TTTT T C AT AAATATT T 
1725 + + + + + + 1784 

AC T C AAAATT AT AAGG TT T GAC T T TGACT AAAC C C AGAAC C TT C TT AGAGAGGAAGAAAT 
1785 + + + + + + 1844 

G C ATGGGC AAAAGC AAAT CAT G C AT ATGGGC AGGAGTAACAT T TT TTT GAC TTT C AT AGA 
1845 + + + + + + 1904 

AAGT AC TGTAT GGC ACT AAAC GGTCT AAAC CGGAC AC TGGAAGC AAAT CGTGC AC GTGGG 
1905 + + + + + + 1964 

CAATATTATCTACCGTCGCGTCGCCAGTCTCCCCATGCCCATGACCATGCTTGGAATTTT 
1965 + + + + + + 2024 

AGT C T CGC C GGAGC T GC C GAGTGC ATGC AT AGT GAC GAGTT T C AATAGGC C AC TAT AT AT 
2025 + + + + + + 2084 

GTGATCATGGCTCTTGATTTGTCACTTTCTTTTTTTGCCGAAGGATATAGTAGTATTACT 
2085 + + + + + + 2144 

TT C T C T GC TAT C AC AAAG AAAG AAC T GAT TGT GT C TAGT C T AGGTGGT C T C AG AATT CT G 
2145 + + + + + + 2204 

CATGACTCCAGAGTATTCTTGATGCCACTTGTTTGTTATTGCAAGAAACTTAATTCGGAG 
2205 + + + + + + 2264 

AC AAC CAAAAGCT C ATC C C ATGTCT CT GGAACT AGTAGAC AT AAGAAAATC T C ATGGT AT 
2265 + + + + + + 2324 

CAGTTTGCTATTTATCTACAACTGAAACGGCATGTTTGGTTTTATTAAATTCAGAACTGG 

2325 + + + + + + 2384 

N W 

TACTGCTACGGGAAGGCGGTTGCGGAGCAGGCGGCATCGGAGTTGGCGCGGCAGCGCX3GC 

2385 + + + + + + 2444 

YCYGKAVAEQAASELARQRG 
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GTGGACCTTGTGGTGGTGAACCCGGTGCTGGTGATCGGCCCCCTGCTGCAGCCGACGGTG 



2445 + + + + + + 2504 

VDLVVVNPVLVIGPLLQPTV 

AACGCCAGCATCGGCCACATCCTCAAGTACCTGGACGGGTCGGCCAGCAAGTTCGCCAAC 

2505 + + + 4- + + 2564 

NASIGHILKYLDGSASKFAN 

GCCGTGCAGGCGTACGTGGACGTCCGCGACGTGGCCGACGCCCACCTCCGCGTCTTCGAG 

2565 + + + + +- + 2624 

AVQAYVDVRD VADAHLRVFE 

TGCGCCGCCGCGTCCGGCCGCCACCTCTGCGCCGAGCGCGTCCTCCACCGCGAGGACGTC 

2625 + + + + + + 2684 

CAAASGRHLCAERVLHREDV 

GTGCGCATCCTCGCCAAGCTCTTCCCCGAGTACCCCGTCCCCACCAGGTACGCGTACGAC 

2685 + + + + + + 2744 

VRILAKLF PEYPVPTR 

CTGCTTGCTAGCCGCTTCCGTTAATTCCATTGCCTTAATTGATTGCATGATGCCGCTCCT 
2745 + 4- + + + + 2804 

AATTTACTCACTTGCGTAACTAATTGCATTCATATATGATCTACCAACCGTGGAGAAAAT 
2805 + + + + + + 2864 

TAGCAAGAGTCTGTCGGGGCGTCCCGGTCCAGTGCAGTTAACCTGCATGTCGATGGTCTG 
2865 + + + + + + 2924 

CAGGT TGC AGCT TAC TTGT GG T TC TT TAGTTC AGAGACAC AGAGC AATT GGGCACT AAGC 
2925 + + + + + + 2984 

AAAACTGAC ATCAC T GGTAAT T AGGT AGC TCC C AC AC ACTGAAGTGGGTGGAT C CC ATC G 
2985 + + + + + + 3044 

GTAGTAGGTAAGGGTGGATAGTACTGGACGAGAGCTCGATCGTTGTTGTAAAAAAGCGAG 
3045 + + + + + + 3104 

TGACCACCACTTCACCATCCACTGCAAGTAGCTGCTAGTGAACCATCCAACCAGCTCCCT 
3105 + + + + + + 3164 

GGATC ACT CT GCTCCGTCCGT AC CT T C AGCT AC C T ACAGAAGCGAC ATG AAC AC AC AGAC 
3165 + + + + + + 3224 

ACACAAGGCCGGCTCACCATTCGCATAGGTCAAACCAAATGTTGGTGAACGGCAACATCG 
3225 + + + + + + 3284 

CCACAAGTCGCGTGCTAGTTCGAGGTTGTGTCCGGTGTACCGAGGCCACACTATTCGTGC 
3285 + + + + + + 3344 

TGCCCGTCG C TGAT ATT TG C AC G C GT AG CT GT C GACGAAAGT AGGT GGACT GAC AG AT AC 
3345 + + + + + + 3404 

ACATATCCTC ATTG CC TT CT C TG CT CGGTTTCTGCTAGGATTGC CATC TTCAGGAGTGC C 
3405 — + + + + + + 3464 

TAT C C GC AC GG CAG AAAC G C GT AG C ATC AGG C C AGAAAGC AGC GTG CGT GAT AT C GTAAC 
3465 + + + + + + 3524 
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CCAGACGGTCTTCACCTGTCCATTCTGGGCTACCTGGCATACTACCTCGGTGCCGCTGTG 
3525 + + + + + + 3584 

CCGCTGACCAATTCGTGCACGACCACTATAGCAAAACCCTATGCATGTAACTGCTTCAAG 
3585 + + + + + + 3644 

ATCAGCAGTGACATGTGCAATATAAACCTCAAGTGTGCACTCTAGTGCGTACTGATAAAA 
3645 + + + + + +-' 3704 

CCGTATAACTGGTGACCCAGTCATTCTTCTCTTTTTTATTTGTTTGGACCAAACGAACAC 
3705 + + + + + + 3764 

AGCATGTTATCCATCACCAACAAGTGGCGCTGATTTTTCAAACTACACTGGGATCATACT 
3765 + + + + + + 3824 

GGAAAC C AAAG C AGGAGAAC AT C TT C GAAC C AAGAGAT GTTT AC TAAATTT GAAAGAAAA 
3825 + + + + + + 3884 

TGTACTGACAAGTAATCTGTCTGAAGCAAGACACATACTACCTCGGTTCGAACGTGGGAC 
3885 + + + + + + 3944 

ACCATGCCCGTGCCATATTTGCTAGGCACCACTCTGCCGTCGATTGTATCCCAACGGAGG 
3945 + + + + + + 4004 

GAGT ATCGATTT GC G C AAAGT T C C T AC ATAC AT AGC C GC T C AAGAT ATAAT CT TACG AC C 
4005 + + + + +--~ + 4064 

TTCCGTCGAAATCGGTGATACGTCGCAACCTATAGCTAACTTGGCAGAGCATAAAATAAC 
4065 4- + + + + + ' 4124 

TATCTAAGGTTGGGGTCTCCCTCTTTTCAATCAACCTTTCATACCGAATGATGGGAGTGT 
4125 + + + + + 4184 

TTGTGAAAACATCTCTTGGTCGACTCAGCATTAGCGCCCTACCAATTTCTCTGTGGACAA 
4185 + + + + + + 4244 

TGCCACCTTAAATCGTTTTTTAGTCTTCATGATTTACTCCCCCTTATATCTGGCCGTAGT 
4245 + + + + + + 4304 

CCCTCTTTTCCATTTTTCTTGTCTGGTTTTAAGTCAAATTTAGACTACTAAAACAACAGC 
4305 + + + + + + 4364 

AAGAT TTTATGGAAGGGAGGTAGTGCAAAAC AG AAAGTCCGATCGAAATGCGTGCCAATT 
4365 + + + + + + 4424 

TGTCGTCGCGGCGGCCGGACTAAAATGGATCTGCATGTGCATACCGTTCGTCGGAGTATC 
4425 + + + +- + + 4484 

CTGCGAACGGTCGTGTGTTTAGTCAACATTAATGTGAGGTTCATGTGATACTCTTGCTTG 
4485 + + + + + + 4544 

AAAGATACTACTACTGCTACCTCGTAGAACTGAATGAAAGTATGTGGGACTGTTCAGCTC 
4545 + + + +- + + 4604 

TCTGCACATGTCAAATGTCGTTACTCATACCTTTCGTCAGAGCATCCTGCGACGCGCGCC 
4605 + + + + + + 4664 

GGTGCCGAAATTTCGCCGTGTGTTTAGTCAAGATCAACGTGAGGTTCATGCGGTACCCTA 
4665 + + + + + + 4724 
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TC TGG C TTC GAAG AT AC C AAG C AGAC T GCGGC T AGATTGT C ATT T T GATGT C GC AAT C TT 
4725 + + + + + + 4784 

CACCAAACCTGCCCTTCCGGACCACAGCAGCAGTACGTAACAATGGTGTCATCGCCATGC 
4785 + + + + + + 4844 

GTTGCTCGTGTCCAAGGAAACGGAGGAATCTCGGCTTCCCACAAGTCACGCATCGATGTT 
4845 + + + + + + 4904 

C AC AC C T G AAT T GGT C G AC GTTTCTTC TT CTAG AC TAGAAAAAGAT TAC AG AAC AAC GC A 
4905 + + + + + + 4964 

AGCTTCGTTCAAGTCCATACTTCTGTTCAGTATACTCCTGATGATTGCAGTTATATCAGC 
4965 + + + . + + + 5024 

ATGTCTATTCTGAATTTTTGCACTTCTATTCAAAGGATGGGCTGGAATTGCTACTGACTT 
5025 + + + + + + 5084 

TGGTGTGATGTGTGTGGC ACAGGTGCT CTGATGAGACGAAC CCGAGGAAGCAGCC ATACA 

5085 + + + + + + 5144 

CSDETNPRKQPYK 

AGAT GT C GAAC C AGAAGC T C C AGGAC C T CGGAC T CGAGT T C AGGC C GGT GAGC C AGT CC C 

5X45 + + + + + + 5204 

MSNQKL QDLGLEFRPVSQSL 

TGTAC GAGAC GGT G AAGAG C C TC C AGG AGAAGGGC C AC CTTCCGGTGCT C AGC GAGC AGG 

5205 + + + + + + 5264 

YETVKS LQEKGHLPVLSEQA 

CAGAGGCGGACAAGGAAACCCTAGCTGCCGAGCTGCAGGCAGGGGTTACCATCCGAGCAT 

5265 + + + + + + 5324 

EADKETLAAELQAGVTIRA* 

G AGGAAC AAGAAAT C AAC C AT GT CC ATAC TGC T ACTGT C ATGT AAAC C AG C TGT TGAATG 
5325 + + + + + + 5384 

CCTAAAATCTAAGTTCTTGTAATACTGTGTTGTTTCATGTGGACTAGATTGATCG 
5385 + + + + + 5439 
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PostalCode : 3000 
PhoneNumber : 
FaxNumber : 
Emai 1 Addr es s : 

<110> OrganizationName : Dairy Research, and Development Corporation 
Application Project 



<120> Title : Modification of Lignin Biosynthesis 
<130> AppFileReference : 40494788 
<140> CurrentAppNumber : AU PQ8154 
<141> CurrentFilingDate : 2001-06-14 

Earlier Applications 



<150> PriorAppNumber : AU PQ8154 
<151> PriorFilingDate : 2000-06-14 

Sequence 



<213> OrganismName : Lolium per enne 
<400> PreSequenceString : 

cggcacgagt ggactttccg acgccggagt cgccgatgat gaccgccttg aggaggtagt 60cgtagtcgt 
c ctccgccctg tacgcgccgc tgcccgccat ttccttcctc gcctcgcggg 120tcctcctccc cgacctg 

cgc taggctctgg atctcgcggg gtttgggcgc ggcgtcctcg 180ctgtgagctc gtgccgaatt cggca 

cgagc caccttcgag gcgtgcactg gtacgagctc 240gcgagccatt gtcagtgcag tgtaggctct get 

actegtt ggccattcca agaagctctc 300tgctccctga aaccagagga tcatgatcac ggtggcggcg c 

ccgaggtgc agcagccgca 3 60gatcgcggcg getgetgegg ccgtggaggc ggcggcaccg gaggegaega 

cgatcttccg 420gtccaggctc ccggacatcg acatcccgac ccacatgccc ctgcacgact attgette 

gc 480gacggcagcc tcggccccgg acgcgccgtg cctcatcacc gcggccacgg ggaagaccta 54 

Ocacgttcgcc gagacgcacc tgctgtgccg caaggccgcg gcggcgctgc aegggctegg 600cgtgcgcc 
ac ggggaccgga teatgetget gctccagaac tccgtggagt tcgcgctcgc 660cttcttcggc gegtec 

atgc tcggcgccgt cagcacggcg gcgaacccgt tctgcacgcc 720gcaggagatc cacaagcagc tegt 

ggcetc eggegegaag ctggtcgtca cgcagtccgc 7 80ctacgtcgac aagctccggc aegaggcett cc 

cccgaatc ggcgaggccc teacegtgat 840ca 

ccatcgac gaggacgacg gcaccccgga cggctgccag ccgttctggg ccctcgtgtc 900agccgccgac 
gagaacagcg tcccggagtc tcccatctcg ccggacgacg cggtggcgct 960gccctactcg tegggcacg 

a cggggctgcc caagggegtg gtgetgaege aegggggget 1020ggtgtcgagc gtggcgcagc aggtgga 
egg cgagaacccg aacctgcaca tgcgggcggg 1080ggaggacgtg gtgctctgcg tgctgccgct cttcc 
acatc ttctcgctca acteggtget 1140gctgtgcgcg ctgegggegg gcgccgccgt gatgetgatg cct 
aggttcg agatgggggc 1200catgctggag ggcatcgagc ggtggcgcgt cacggtggcg gccgtggtgc c 
gccgctggt 12 60gctcgcgctc gccaagaacc ccggggtgga gaagcacgac ctcagctcca ttcggatcgt 
1320gctctccggc gccgcgccgc teggcaagga gctcgaggac gcgctacgtg gccgcctgcc 13 80g 
caggccatc tteggacagg getaegggat gaeggaggee gggccggtgc tgtccatgtg 1440cccggcgttc 
gegegggage cgacgccggc caagtccggc tegtgeggea ccgtggtgcg ISOOcaacgcccag ctcaaggt 
gg tcgaccccga caccggcgtc tccctcggcc gcaacctccc 1560cggcgagatc tgcatccgcg gcccgc 
agat catgaaagga tacttgaatg atcccgtggc 1620cacggccgcg accatcgacg tcgaggggtg gctc 
cacacc ggegacateg getaegtega 1680cgac 

gacgac gaggtcttca tcgtcgaccg cgtcaaggag ctcatcaagt tcaagggctt 1740ccaggtaccg cc 
ggecgage tcgaggctct gctcatcgcg catccgtcca tcgccgacgc 1800ggccgtcgtc cegcaaaagg 
atgatgeege eggegaggtc ccggttgcct tcgtggtccg 18 60cgccgccgac tccgacatcg ccgaggagg 
c catcaaggag ttegtatcca agcaggtggt 1920gttctacaag aggctgeaca aggtctactt cacccac 
gcg atacccaagt eggegteggg 1980gaagatactc aggaaagaac tcagagctaa actcgccgcc cegge 
cactg cctgaagagt 2040ggttcatggc ttcatgctaa tcatttcgat cagaaaggca cttctagcat ata 
tgttcca 2100ccttttgttt catttggaag attgtattcc agctagtggc cagtgactga gtaagggatg 

■ 2160gggataaaag ttttgtctac gttttctttt acgctactct ctccattggg gagtacaatg 2220tat 
cagggga ttcgtgattg aagttaatca agattggttc aattataaaa aaaaaaaaaa 2280aaaa 

2284 

<212> Type : DNA 
<211> Length : 2284 

SequenceName : 4CLlcDNA (SEQ ID NO : 1) 

SequenceDescription : 

Custom Codon 



Sequence Name : 4CLlcDNA (SEQ ID NO: 1) 
Sequence 
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<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

MI TVAAPEVQ QPQIAAAAAA VEAAAPEATT IFRSRLPDID IPTHMPLHDY CFATAASAPD 6 OAPCL ITAAT 

G KTYTFAETHL LCRKAAAALH GLGVRHGDRI MLLLQNSVEF ALAFFGASML 12 OGAVSTAANPF CTPQEIH 

KQL VASGAKLWT QSAYVDKLRH EAFPRIGEAL TVITIDEDDG 18 OTPDGCQPFWA LVSAADENSV PESPI 

SPDDA VALPYSSGTT GLPKGWLTH GGLVSSVAQQ 2 4 0 VDGENPNIiHM RAGEDWLCV LPLFHIFSLN SVL 

LCALRAG AAVMLMPRFE MGAMLEGIER 3 0 OWRVTVAAWP PLVLALAKNP GVEKHDL SSI RIVLSGAAPL G 

KELEDALRG RLPQAI FGQG 3 60YGMTEAGPVL SMC PAFAREP TPAKSGSCGT WRNAQLKW DPDTGVSLGR 

NLPGEICIRG 420 PQ IMKGYLND PVATAATIDV EGWLHTGDIG YVDDDDE VF I VDRVKELIKF KGFQVPPA 

EL 480EALLIAHPSI ADAAWPQKD DAAGEVPVAF WRAADSDIA EEAIKEFVSK QWFYKRLHK 54 

OVYFTHAI PKS ASGKILRKEL RAKLAAPATA RWHGFML I I SIRKALLAYM FHLLFHLEDC 600IPASGQ 

606 

<212> Type : PRT 
<211> Length : 606 

SequenceName : 4CLlpep (SEQ ID NO : 2) 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

cggcacgagc gccattcctc caccttcagc tccggccaaa gatttccatc cggcgagatc 60catgggctc 
c atcgcggcgg acgcgcctcc cgcggagctg gtgttccggt ccaagctccc 120ggacatcgag atcccga 

ccc acctgacgct gcaggactac tgcttccagc gcctgccgga 180gctctccgcg cgcgcctgcc tcatc 

gacgg cgccacgggc gccgcgctca cctacggcga 240ggtggacgcc ctgtcccgcc gctgcgccgc ggg 

gctgcgc cgcctcggcg tcggcaaggg 3 00cgacgtcgtc atggcgctcc tccgcaactg ccccgagttc g 

ccttcgtgt tcctcggcgc 3 60ggcccggctc ggcgccgcca ccaccaccgc caacccgttc tacacgcccc 

acgagatcca 420ccgccaggcc accgccgccg gggccagggt catcgtcacc gaggcctgcg ccgtcgag 

aa 480ggtgcgcgcc ttcgccgccg agagagggat tcccgtcgtc tccgtcgacg agggcgtcga 54 

Ocggcggctgc ctcccgttcg ccgagactct gctcggggaa gaaagcgggg agcggttcgt 600cgacgagg 
eg gtcgaccccg acgacgtggt ggcgctgccg tactcgtccg gcaccaccgg 660cctgcccaag ggcgtc 

atgc tcacccaccg cagcctcgtc accagcgtcg cccagcaggt 720ggacggtgag aacccgaacc tgea 

cttcag ctcgtcggac gtgctgctgt gcgtgctgcc 7 80gctgttccac atctactcgc teaacteggt gc 

tgctcgcc ggtctccgcg ccgggtgcgc 840ga 

tegtgate atgcgcaagt tcgaccacgg cgcgctggtg gacctggtgc gcacgcacgg 900cgtcaccgtg 
gcgccattcg tgccgcccat cgtggtggag ategecaaga gcgcgcgggt 960gaccgccgcg gacctggcg 

t ccatccggct ggtcatgtcg ggggeggege ccatgggcaa 1020ggagctgcag gaegegttea tggccaa 
gat ccccaacgcc gtgctcggcc agggatatgg 1080gatgaccgag gccggccctg tgctggcgat gtgee 
tggee ttcgecaagg agccgttcgc 1140ggtcaagtcc ggttcctgcg gcaccgtcgt caggaacgcc gag 
ctcaaga tcgtcgaccc 1200cgacaccggc gcctccctcg gccgcaacct geegggggag atctgcatcc g 
eggcaagea 1260gatcatgaaa ggttacctaa atgatccggt ggccacaaag aacaccattg acaaggaegg 
1320ttggctgcat actggtgaca ttggttatgt cgatgatgac gacgagatct ttattgtcga 13 8 0c 
agactgaag gagataatta aatataaggg attccaagta cctccggcgg aacttgaagc 1440ccttctcatt 
acacaccctg aaatcaagga tgctgctgtc gtatcgatgc aagacgaact ISOOtgctggtgaa gttccggt 
tg cgtttgttgt geggactgag ggttcagaga teagegaaaa 15 60cgagatcaag cagttcgttg caaaag 
aggt tgttttctac aagaggatct gcaaagtgtt 1620cttcgcggat tccattccaa agagtccatc tggc 
aagatc ctcaggaagg acctgagagc 1680aaag 

ctcgcc gcaggcattc ccagcagtaa taccacacag tccaaaagct aagtcagata 1740tattgtttcc ca 
accttaca cacctctgtc caacaccatg taatgttctt aatataaacg 1800gaaattatta catatagaag 
ggctgattct ttttactaga tgtgtccaac atatgatatg 18 60cttgttaggc cgatgatgtg taacctgtc 
a tgtatagata ccgccttttt ttgacaagaa 1920aggctgatta taatgtatac cgtgaactga atatttg 
ttc agggagatca aaaaaaaaaa 1980aaaaaaaaaa aa 

1992 

<212> Type : DNA 
<211> Length : 1992 

SequenceName : 4CL2cDNA (SEQ ID NO : 3) 

SequenceDescription : 

Custom Codon 



Sequence Name : 4CL2cDNA (SEQ ID NO : 3) 
Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

MG S I AADAP P AELVFRSKLP DIEIPTHLTL QDYCFQRLPE LSARACLIDG ATGAALTYGE 6 0VDALSRRCA 
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A GLRRLGVGKG DWMALLRMC PEFAFVFLGA ARLGAATTTA NPFYTPHEIH 120 RQ ATAAG ARV IVTEACA 

VEK VRAFAAERGI PWSVDEGVD GGCLPFAETL LGEESGERFV 1 8 ODEAVDPDDW ALPYSSGTTG LPKGV 

MLTHR SLVTSVAQQV DGENPNLHFS SSDVLLCVLP 240LFHIYSLNSV LLAGLRAGCA IVIMRKFDHG ALV 

DLVRTHG VTVAPFVPPI WE I AKSARV 3 00 TAADLAS I RL VMSGAAPMGK ELQDAFMAKI PNAVLGQGYG M 

TEAGPVLAM CLAFAKEPFA 3 60VKSGSCGTW RNAELKIVDP DTGASLGRNL PGEICIRGKQ IMKGYLNDPV 

ATKNTIDKDG 420WLHTGDIGYV DDDDEIFIVD RLKEIIKYKG FQVPPAELEA LLITHPEIKD AAWSMQD 

EL 4 8 0 AGEVPVAFW RTEGSEISEN EIKQFVAKEV VFYKRICKVF FADSIPKSPS GKILRKDLRA 54 

OKLAAGIPSSN TTQSKS 556 
<212> Type : PRT 
<211> Length : 556 

SequenceName : 4CLpep (SEQ ID NO : 4) 
SequenceDe script ion : 

Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

cggcacgaga tctcccacga ctaatttaga agaagattta cttagtctct gcttctcgct 60cgatcgccg 
g ccggtgaggt agctagctag ctactcgtac tagaccatta ccatgggttc 120cgtgccggag gagtcag 

tgg tggcggtggc accggcggag acggtgttcc ggtcgaagct 180ccccgacatc gagatcaaca acgag 

cagac gctgcagagc tactgcttcg agaagatggc 240cgaggtcgcg tcccgcccct gcatcatcga egg 

ccagacg ggcgcctcct acacctacac 300ggaggtcgac tccctgaccc gtcgcgccgc ggcggggctg c 

geegcatgg gcgtggggaa 3 60gggcgacgtg gtgatgaacc tgctgcgcaa ctgcccggag ttcgccttct 

ccttcctggg 420cgcggcgcgg ctgggcgccg ccaccaccac cgccaacccg ttctacaccc cgcacgag 

at 480ccaccgccag geggaggegg cgggcgccaa getgategtg accgaggcct gcgccgtgga 54 

Ogaaggtgctg gagttcgegg eggggegggg cgtgcccgtg gtcaccgtcg aegggaggeg 600cgacgggt 
gc gtggacttcg eggagctgat cgccggcgag gagctgcccg aggeggaega 660ggccggggtc ctcccc 

gacg acgtcgtcgc cctgccctac tcctccggca ccaccgggct 720ccccaagggc gtcatgetea ccca 

ccgcag cctcgtcacc agcgtcgccc agctggtcga 780cgggtcgaac cctaacgtgt gcttcaacaa gg 

aegacgeg ctgctgtgcc tgctgccgct 840ct 

tccacatc tactcgctgc acaeggtget gctggcgggg ctccgcgtcg gcgccgccat 900cgtcatcatg 
cgcaagttcg acgtcggcgc gctggtggac ctcgtccgcg cgcaccgcat 960caccatcgcg ecattegtg 

c cgcccatcgt cgtggagatc gecaagageg accgcgtcgg 1020cgccgacgac ctcgcatcca teegcat 
ggt gctctccggc gccgcgccca tgggcaagga 1080cctccaggac gecttcatgg ccaagatccc caacg 
ccgtg cteggacagg ggtaegggat 1140gaccgaggct gggccggtgc tggccatgtg cctggcgttc gec 
aaggagc cgttcaaggt 1200caagtccggg tcgtgcggaa ccgtggtgcg caacgccgag ctcaaggtcg t 
cgaccccga 12 60caccggcgca tccctcggcc ggaaccagcc tggegagatt tgcgtccggg ggaagcagat 
1320catgataggt tacctgaacg acccagagtc gaccaagaac accatcgaca aggaeggctg 13 80g 
ctgcacacc ggagacatcg gcttggtgga tgacgacgac gagatcttca tegtcgacag 1440gctcaaggag 
atcatcaagt acaagggctt ccaagtggcg ccggcggagc tcgaggccct ISOOcctcctcacg aaccegga 
gg teaaggaege cgccgtcgta ggggtgaagg atgatctctg 15 60cggcgaagtc ccggtcgcct tcatta 
agag gatcgaagga tctgagatca acgagaacga 1620gatcaagcaa ttcgtctcaa aggaggttgt tttc 
tacaag aggatcaaca aggtctactt 1680cacc 

gactcc attcccaaga acccttccgg caagatccta aggaaggact tgagagccag 1740gctcgccgct gg 
catcccca ccgaagttgc cgcgccgaga agctaagggc cgcttctcag 1800gaacgcagtc acccatggtg 
ctgtttaggt gctgttatag accacaccaa atggggaaag 18 60aaactacggg aggggatcat attattgtt 
g caggagatat cagtttgttg attcgccctg 1920cttgtgtaat gttgataaaa tgaaatgata taataga 
tgt gttgttttat tttttgacca 1980tgtaagaaca aggctgtttt atacactact tattttttga aaaaa 
aaaaa aaaaaaaa 203 8 

<212> Type : DNA 
<211> Length : 2038 

SequenceName : 4CLCDNA (SEQ ID NO : 5) 

SequenceDescription : 

Custom Codon 



Sequence Name *: 4CLCDNA (SEQ ID NO : 5) 
Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

MGSVPEESW AVAPAETVFR SKLPDIEINN EQTLQSYCFE KMAEVASRPC I IDGQTGASY 6 0 TYTEVDSLT 

R RAAAGLRRMG VGKGDWMNL LRWCPEFAFS FLGAARLGAA TTTANPFYTP 120 HE IHRQAE AA GAKLIVT 

EAC AVE KVLE FAA GRGVPWTVD GRRDGCVDFA ELIAGEELPE 1 8 0 ADEAGVLPDD WALPYSSGT TGLPK 

GVMLT HRSLVTSVAQ LVDGSNPNVC FNKDDALLCL 240 LPLFH I YSLH TVLLAGL RVG AAIVIMRKFD VGA 

LVDLVRA HRITIAPFVP PIWEIAKSD 3 0 0RVGADDLAS I RMVLSGAAPM GKDLQDAFMA KIPNAVLGQG Y 

GMTEAGPVL AMCLAFAKEP 3 60FKVKSGSCGT WRNAELKW DPDTGASLGR NQPGE ICVRG KQIMIGYLND 
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PESTKNTIDK 42 ODGWLHTGDIG L VDDDDE I F I VDRLKEIIKY KGFQVAPAEL EALLLTNPEV KDAAWGV 

KD 4 8 ODLCGEVPVAF IKRIEGSEIN ENEIKQFVSK EWFYKRINK VYFTDSIPKN PSGKILRKDL 54 

ORARLAAGIPT EVAAPRS 557 
<212> Type : PRT 
<211> Length : 557 

SequenceName : 4CLpep (SEQ ID NO : 6) 

SequenceDescription : 

Sequence 



<213> Organ ismName : Lolium perenne 
<400> PreSequenc eSt ring : 

ggcacgagga atcctaccaa accgagctac cagatccttc tctactaatc gagctcccta 60cgctgctcc 
g cctgtcttcg tttccgcctc accgccggcc ggttctccgc tccaagctac 120gtccgtccgt ccacata 

tat agcatcgaca tgaccatcgc cgaggtcgtg gctgccggag 180acaccgccgc cgcggtggtg cagcc 

cgccg ggaacgggca gaccgtgtgc gtgaccggcg 240ccgccgggta catcgcgtcg tggctcgtca age 

tgetget ggagaagggg tacaccgtca" 300agggcaccgt caggaaccca gacgacccga agaacgegea c 
ctgagggcg ctcgacggcg 3 60ccgccgaccg gctggtcctc tgcaaggccg acctcctcga ctacgacgcc 

atccgccgcg 42 0ccatcgacgg ctgccacggc gtcttccaca ccgcgtcccc cgtcaccgac gaccccga 

gc 480aaatggtgga geeggeggtg aggggcaege agtaegtcat agacgeggeg gcggaggccg 54 

Ogcacggtgcg gcggatggtg ctcacctcct ccatcggcgc cgtcaccatg gaccccaacc 600gcgggccg 
ga cgtggtcgtc gaegagtegt getggagega cctcgacttc tgcaagaaaa 660ccaggaactg gtactg 

ctac gggaaggegg ttgeggagea ggeggcateg gagttggcgc 720ggcagcgcgg cgtggacctt gtgg 

tggtga acccggtgct ggtgatcggc cccctgctgc 7 80agccgacggt gaacgccagc atcggccaca tc 

ctcaagta cctggacggg tcggccagca 840ag 

ttcgccaa cgccgtgcag gcgtacgtgg acgtccgcga cgtggccgac gcccacctcc 900gcgtcttcga 
gtgcgccgcc gcgtccggcc gccacctctg cgccgagcgc gtcctccacc 960gcgaggacgt cgtgcgcat 

c ctcgccaagc tcttccccga gtaccccgtc cccaccaggt 1020gctctgatga gacgaacccg aggaagc 
age catacaagat gtcgaaccag aagctccagg 1080acctcggact cgagttcagg ccggtgagcc agtcc 
ctgta egagaeggtg aagagcctcc 1140aggagaaggg ccaccttccg gtgctcagcg agcaggcaga ggc 
ggacaag gaaaccctag 1200ctgccgagct gcaggcaggg gttaccatcc gagcatgagg aacaagaaat c 
aaccatgtc 1260catactgcta ctgtcatgta aaccagctgt tgaatgecta aaatctaagt tcttgtaata 
1320ctgtgttgtt tcatgtggac tagattgatc gaataaacat ctctacacaa ggttgctaaa 13 80a 
aaaaaaaaa aaaaa 1395 
<212> Type : DNA 
<211> Length : 1395 

SequenceName : CCRlcDNA (SEQ ID NO: 7) 

SequenceDe script ion : 

Custom Codon 



Sequence Name : CCRlcDNA (SEQ ID NO: 7) 
Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

MTIAEWAAG DTAAAWQPA GNGQTVCVTG AAGYIASWLV KLLLEKGYTV KGTVRNPDDP 60KNAHLRALD 
G AADRLVLCKA DLLDYDAIRR AIDGCHGVFH TASPVTDDPE QMVEPAVRGT 120QYVIDAAAEA GTVRRMV 

LTS S I GAVTMDPN RGPDWVDES CWSDLDFCKK TRNWYCYGKA 1 8 OVAEQAASELA RQRGVDLVW NPVLV 

IGPLL QPTVNAS IGH ILKYLDGSAS KFANAVQAYV 240 DVRDVADAHL RVFECAAASG RHLCAERVLH RED 

WRILAK LFPEYPVPTR CSDETNPRKQ 300 P YKMSNQKLQ DLGLEFRPVS QSLYETVKSL QEKGHLPVLS E 

QAEADKETL AAELQAGVT I 3 60RA 

362 

<212> Type : PRT 
<211> Length : 3 62 

SequenceName : CCRlpep (SEQ ID NO : 8) 

SequenceDescription : 

Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

ggcacgagca acaagtcatc aatggcggaa ggcttgeegg cgctcggttg ggctgegagg 60gacgcctcc 
g gtcacctctc cccttacagc ttctcgagaa gcgttccgaa ggacgacgat 120gtgacgatca aggtget 

ctt ctgegggate tgccacactg acctccacat catcaagaac 180gactggggca acgccctcta cccca 

tegtc ccagggcatg agatcgtggg cgtcgtcgcc 240agcgtcggca gcggcgtcag cagcttcaag gec 

ggegaca cggtgggcgt gggctacttc 300ctcgactcct gccgcacctg ctacagctgc agcaaggggt a 
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cgagaactt ctgccccacc 3 60ctgacgctca cctccaacgg cgtcgacggc ggcggcgcca ccacccaggg 

cggcttctcc 420gacgtcctcg tcgtcaacaa ggactacgtc atccgcgtcc cggacaacct gcccctgg 

cc 480ggcgcggcac ctctcctctg cgccggcgtc acagtctaca gccctatggt ggagtacggc 54 

Octcaacgccc ccgggaagca cytcggcgtc gtcggcctgg gcgggctcgg ccacgtcgcc 600gtcaagtt 
eg geaaggcett egggatgace gtcaccgtca tcagctcctc ggacaggaag 660cgcgacgagg egcteg 

gecg cctcggcgcc gacgccttcc tegtcagcag cgaccccgag 720cagatgaagg eggeggeggg cacc 

atggac ggcatcatcg acacggtgtc cgcgggccac 780ccgatcgtgc cgctgctcga cctgctcaag cc 

catggggc agatggtcgt ggtgggcgcg 840cc 

cagcaagc cgctcgagct cccggccttc gccatcatcg gcggcggcaa gcgcctcgcc 900gggagcggca 
ccggcagcgt cgcacactgc caggccatgc tcgacttcgc gggcaagcac 9 60ggcatcaccg ccgacgtcg 

a ggtcgtcaag atggactacg gtcaacaccg ccatcgagcg 1020gctagagaag aacgaegtea ggtaccg 
ctt cgtcatcgac gtcgccggca gccacctgca 1080gggcaccgcc gcttaacttg tgctacacaa tgtgg 
aegeg cgctcgtttg gtccagaaaa 1140aggttcgccg gctcacagcc acatgaacaa gtcaatgagt cgt 
tggtgtg ttgtttatct 1200tcattccaca tatgggaege agttccagat tttcatgtca aataattgcg t 
cgtgtgcgg 1260ttgtcaagac tcaaatagga gaaaaaaaga ctcgtgattt cgttttgcaa aaaaaaaaaa 

1320aaaaa 1325 
<212> Type : DNA 
<211> Length : 1325 

SequenceName : CADlcDNA (SEQ ID NO: 9) 

SequenceDescription : 

Custom Codon 



Sequence Name : CADlcDNA (SEQ ID NO: 9) 
Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

MAEGLPALGW AARDASGHLS PYSFSRSVPK DDDVTIKVLF CGICHTDLHI I KNDWGNAL Y 60PIVPGHEIV 
G WASVGSGVS SFKAGDTVGV GYFLDSCRTC YSCSKGYENF CPTLTLTSNG 1 2 0 VDGGGAT TQG GFSDVLV 

VNK "DYVT RVPDNL PLAGAAPLLC AGVTVYSPMV EYGLNAPGKH 1 8 OXGWGLGGLG HVAVKFGKAF GMTVT 

VISSS DRKRDEALGR LGADAFLVSS DPEQMKAAAG 240TMDGIIDTVS AGHPIVPLLD LLKPMGQMW VGA 

PSKPLEL PAFAI IGGGK RLAGSGTGSV 3 0 0 AHCQAMLDFA GKHG I TADVE WKMDYGQHR HRAAREERRQ V 

PLRHRRRRQ PPAGHRRLTC 3 60ATQCGRALVW SRKRFAGSQP HEQVNESLVC CLSSFHIWDA VPDFHVK 

407 

<212> Type : PRT 1 
<211> Length : 407 

SequenceName : CADlpep (SEQ ID NO: 10) 

SequenceDescription : 

Sequence 



<213> OrganismName : Lolium prenne 
<400> PreSequenceString : 

ggcacgagtc gcctccaacg tcttccctta accggccgtc cctacgcttg caccaccacc 60acgcacaga 
c agagcagttt cccagccccc gccggaaccg gatggcaccc aeggeggegg 120agcagacgga gcaccac 

cag cacaccagga aggcggtggg gctggcggcg cgcgacgacg 180ccggccacct ctccccgctc gecat 

cacac ggaggagcac aggagacgac gatgtggtga 240taaagatttt gtactgcgga atctgccact ctg 

acctgea cgccctgaag aacgactgga 300agaactcaag gtacccgatg atccccgggc aegagatege c 

ggcgaggtc acggaggtgg 3 60gcaagaacgt gagcaagttc aaggccggcg accgcgtggg cgtcgggtgc 

atggtgaact 420cgtgccggtc gtgegagage tgegacaagg gcttcgagaa ccactgcccg ggcatgat 

cc 480tcacctacaa ctcggtcgac gtcgaeggea ccgtcaccta cggcggctac tccagcatgg 54 

Otggtggtgca egageggtte gtggtccggt tccccgacgc catgccgctg gaeaagggeg 600cgccgctg 
ct gtgcgccggc atcaccgtgt acagccccat gaagtaccac gggctcaacg 660ttcccgggct gcacct 

egge gtgctggggc tgggeggget gggccacgtt gcggtcaagt 7 20tcggcaaggc etteggaatg aaag 

tgaegg tgatcagctc gtcgccgggg aagaaggagg 780aggccctggg gcggctgggc gccgacgcgt tc 

at cgt cag caaggacgcc gacgagatga 840ag 

gctgtgat agcaccatgg atggcatcat aaacacggta tetgeaaaca tccccctgac 900ccctctcttc 
gggctgetea agcccaacgg caagatgatc atggtcggcc tccccgagaa 960gcccatcgag attcctccc 

t tegctctagt tgccacgaat aagaccctgg cegggagcat 102 0catcggcggc atgagegaca cgcagga 
gat gctggacctc geggegaage aeggegtgae 1080ggccgacatc gaggtggtcg gcgcggagta tgtga 
acacg gecttggage gccttgccaa 1140gaacgacgtc aggtatcget tegtcatega catcggcaac acc 
ctcgaca atgttgcggc 1200caccaccgag tgaaegtact cagcactgct tacgatctac gttgttccac t 
gttagtgct 1260ccgtagtaaa caataaacga tcaaaactct tgtcatctgg tgcattggtg tagacatggt 

1320tgtttgcgag gaaactgagt tgaaggatgg atggataaaa aaaaaaaaaa aaaaaaa 1377 
<212> Type : DNA 
<211> Length : 1377 



6 



WO 01/95702 



PCT/AU01/00699 



SequenceName : CAD2cvEllettcDNA (SEQ ID NO: 11) 
SequenceDescription : 

Custom Codon 

Sequence Name : CAD2cvEllettcDNA (SEQ ID NO: 11) 
Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

MAPTAAEQTE HHQHTRKAVG LAARDDAGHL SPLAITRRST GDDDWIKIL YCGICHSDLH 60ALKNDWKNS 
R YPMI PGHE I A GEVTEVGKNV SKFKAGDRVG VGCMVNSCRS CESCDKGFEN 120 HCPGMI LTYN SVDVDGT 

VTY GGYS SMWVH ERFWRF PDA MPLDKGAPLL CAGITVYSPM 180 KYHGLNVPGL HLGVLGLGGL GHVAV 

KFGKA FGMKVTVISS SPGKKEEALG RLGADAF I VS 2 4 OKDADEMKAVI APWMASXNTV SANIPLTPLF GLL 

KPNGKMI MVGLPEKPIE IPPFALVATN 3 00KTLAGSIIGG MSDTQ EMLDL AAKHGVTADI EWGAEYVNT A 

LERLAKNDV RYRFVIDIGN 3 60TLDNVAATTE 

370 

<212> Type : PRT 
<211> Length : 370 

SequenceName : CADcvEllettepep (SEQ ID NO: 12) 

SequenceDescription : 

Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

gcagcggttn caaatcgccg gtcctggggt ggaagtgnag cagtgggaag atgtgtgcga 60ggggttgtg 
t tttggatgna agacaggcgg gccagtggag aacaagagag aacgcgagag 120gccaaagtat ccgcagc 

ccc gcaaacaagg cctagatttg ggttaagttt gggtcgtctc 180agacaccgcg gccatccttt taggt 

ggtcc gcgcgctgga ccgtattttt atctgagttg 240acccattcag acgcgcagac acgagatgga tgg 

tgcagtw agagatgacc taagtacaar 300aacctctccc cgagctgccg ccatccgtca cttaccgagc g 

acaaagctt cccacttcat 3 60cacactcagc ccagcaagca tactgatggt gagcgcactc gcggctgtgc 

ccaccgaccc 42 0cacgccatcc aaaaccaact ctacttttca ccmcaccaac aaaagacaaa atatggtg 

ga 480ttttgtgatg agatggaagc ggagcttgtc agaatgggaa acgcataaat cgagaacacg 54 

Otatacagtgc tggaaattgg atgactaagc cccaagggtt agaaaaaaaa tnagaccatg 600tctagatg 
ga attagacatt ttttgatata atagaagcgg gacttggcgc gacaatttca 660aacttcgtcc ctaaca 

ggta tcgaactttc gatagttagc gtgtgctact gcggaccccc 720aaccacttgt gttaagccca catc 

ggttaa ggcccaaggg ttagatgaaa gtaccaatct 780cactcatttg cgactagcta caaaacttgc tt 

ttcacatg tacggtcata ctacaatttt 840ga 

ccttggta acgtaagtat ggactgtatg gtgtgctaag gtgtgttggc agctcaaata 900aacccaaaaa 
tttcaacaca cgtcaaccat gaactgagat tcacaccaac ggctgagccg 960tctcctttaa aagatagag 

g gagaaaacca taatcaccat tggtggtcat gtgtgagtgt 1020gcaagcaaaa aaaaatggag aagccaa 
aac ccgttgagag agtgcgagag catacaagaa 1080caccacaaca aagtgtgaag gagaaaaaga atatg 
agata agatttcgga aatacttttg 1140cacacccatg catgggtgtg ggtgtttccg tcaccgtcta tgt 
atttctc gaaattcatg 1200cccaccatgg tagataaaaa tatbtttttc tctctcctct ttttattcaa a 
tctcaaagc 12 60ataakrartg gtgacagaac gataagattc ctacctagct ttctgagatc ccactagttt 
1320atcttcaagc tggtgattga aggattaacc atgcttgaat tagattggct tcaaacttgg 1380t 
agtagcttg tttcatactt tgattacttt ggtatggtta gttggtttga gattttggtc 1440aatgtagaat 
cagatttgag agcgattgtc agcttgaatt gccgcagttt tagcacatac 1500tagtttggat agatgaac 
ag tttggagaga caaataatgt ctatacgagc tcatcggata 15 60atattagtct atggcttttg cttcgg 
tgtc ccctctgcaa actttacccc tctgtagatg 1620gtaggatttt ctgatatcct ttcatggttt aagg 
gtgtgc gtgtaaggaa cgggagatac 1680cgga 

tcacac cttttcgtct acactttaca agcatgtaac acctaagatt gattgatatc 1740taggcttaca cc 
ccaatgga, ggtaaactaa tattattgaa atgcgacttt tcaaaagtcc 1800caatataacc ttgacgatga 
tcttacaact actcgcgcca gtcttgtatg atatcagatt 1860ggccgaggat cgtgggtacc ttgtagtgg 
a ctatgatgct catggaggtt gtatggacat 1920gttgtaatgc tggttttctc taggt ttttt ctaatca 
act tggcattctt ctccttaaca 1980cataataaga gggaatacct ccatacatta ttctgaaaaa agcat 
ggcca acaatgaaac 2 040agaaacaagt acgacagtct atacccgacc caaacaatgg ctcaggtctt tea 
egatgea 2100tagtttgtta gcatgtattt tatagtagga actaaaattt aaagacaact tgenaaaaca 

2160attttgtctc ttgagtgttt tttaaggatg eggcatttat cgattataca ttacatatgt 2220gat 
tggatta gecaactttt tgtcttccga tgatcatatg aaagggttgt atcttagggc 2280atctccaatg g 
gnagactca aatgcaaaaa aatngtccgt ttgggtcttc cnggacaaaa 2340cctgctccca aeggggcaac 
ccaacttaaa aacggacagg tgcagcgtcc ggcntgaccc 2400aaaactgacg caaatttggn anattttt 
9"g ggcnagccag aegaacgegg gcgtccactg 2460tatccgacta tgtccgcatc ctggcccatc tgacag 
tgac acaaaataca accacatgcg 2520cccccc 

accc ttctctctcc tccgttcgcc ttttcccatg gaancngtcc tcgctcctcg 2580ccggaattga tctc 
gectaa ccatgctccg ccgccaccct cgectkaagg ccccagccgc 2 640cgctacctcc tttttgtcag cc 
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ctattgga agtcgccgga gttgaaacga gcgccgccag 2700cctcgacacc gccgagcaag acgaagactg 
ggcggagctc gccgagacgg gacggggacg 27 60gagctcgcca tgcgtgcctc gcaggggcgc gatgggggc 
g gagctcgccg tggctggctg 2820cagcacctcg ggccgctgct agccgtgcca cgacgcgagc atgcgcc 
teg acgccgcccc 2880gtgctacctc gtcgcgcgcc cagggccgcc ccgcccctgc cgaccggcgg eggag 
aegeg 2940accttcgcgg acgtgcccgg eggcagagae gcgtccttcg cgacagcgcc ctcctcgatc 
3000tccgtcgagc cgcatacgcg gctaggaggg aegegggegt ccccggtgtc ggcctccgtt 3060gtggc 
gcatc gegggegegg cctccgtcga ggcgcatcgc gggcgtggcc tcgtggcgca 3120gcctgccctg att 
eggtctg aggegeggeg cggagcttcc tcgcggcggc gegggeggag 3180cctcctcgct gcggcgcgac c 
tgctctgcc geggtccgag acgcggcgcg ggcagagctt 3240cctcgcggcg gctcgggcgc ggcttcctcg 
eggegatgge gcttccaggc tcgcacgcgg 3300cctccggcgt ggegcagega gagcgcagcc tccggtga 
gt taggcacagg cgcgacacga 33 60catccccg 

gc ctcggcctcc ggcgtggcgc agegegageg cgacgtagcc taggttggca 3420actagtacta cgagga 
agaa agaggagaaa caattatttg ggtcacagcg ttgggcgtac 348 0tgtgcgatcc aaaeggacac cegg 
aegega aacgatgtca gcgtgtccgc gtggcgaccc 3540aaacgacccg aaacggacgt ccgtttgggt eg 
gtgcgttg gagatgeect tactccccat 3 600cctcaaatga gtctaattat atatcttgtt gtaagtttta 
aaaaagttaa actttgatca 3 660acattagtaa tgatagtagc aacgaataca aaattaaatt gtaaaaata 
t attatgaaac 3720tttattttaa gatggatcta gttatactaa ttttctgegg atggaggaag tagctaa 
ata 3780ttgttaattt ctaaataaaa aattaaaact ttaacttaaa acaaaagtta caagcataat 38 
40tatctgtgga tggaggaagt agctaagata caccaatcct ctctctacat tacctagcat 3900gccacat 
cag gaaactattt aggataagct ccaaggaacc acccagaaca acaatttaca 39 60tggcctggct aacct 
aatga caatttccga gcaactggtg gtggtggtac gcgttccttg 4020 ttcaattgtc tctattacaa gag 
tggcect gtataggtaa aaaaaaataa caagcttcca 4080aggacggcca tgttccttgt tcctgcaggc t 
geaegtact cacgacgaag tgtatctcgt 4140gttctggaca tttgtctcgc gcattttgta accatgaaat 

taaaaatgtg gtggcctgct 4200atatctgtat 
gggggtatca tgcactcctt cgcagaggaa tccagacgac gatttacacg 42 60tgtttccacc ttagctttt 
t ttaagtgtgt gtgtaaggaa cgatcatata actgcccctg 4320aatgctgcat atatataaac cgactcc 
ate atgtactcga gacaaggtcg tcaagaaaaa 4380caaactatgc ctatctcact agcaatgatt tgaga 
gtaca gettttcegg tgccatattt 4440tttcctatat atctttttct gaagaacaag aaaaaaaaaa cag 
ttggtgt ggtggttggt 4500gaagcgagaa agccccatat aagccctgct caccctcccc gcaaagcaca a 
ctcatagct 45 60cgggtctctc gctcacacca aaatcgccca ccagcaccag catctctcga teggcagacg 
4620catagatcga tgggctccac cgccgccgac atggccgcgt ccgcggacga ggacgcgtgc 4680a 
tgttcgccc tccagctcgc ttcctcgtcg gtcctcccga tgacgctgaa gaacgccatc 4740gagcttggcc 

tcctggagat cctggtggcc gccggcggca agtegctgae cccgaccgag 4800gtggccgcca agctcccg 
tc cgcggcgaac ccggaagcgc cggacatggt ggacegcata 48 60ctccggctgc tegegtegta caacgt 
cgtg acgtgcctgg tggaggaggg caaggaegge 492 0cgcctctccc ggagctaegg cgccgcgccc gtgt 
gcaagt tcctcacccc caacgaggac 4980ggcgtctcca tggeggeget cgcgctcatg aaccaggaca ag 
gtcctcat ggagagctgg 5040tgagtctctc ag 

tggagcta gttactgtag atccgaattc gttcccttta gtgagggtta 5100attccgcggc cgcgtcgac 

5119 

<212> Type : DNA 
<211> Length : 5119 

SequenceName : LpOmtlpromoter (SEQ ID NO: 13) 

SequenceDe script ion : 

Custom Codon 



Sequence Name : LpOmtlpromoter (SEQ ID NO: 13) 
Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

tcccgtatct teaaegtgae acccctacac ttcctgcttg tcttggagat ttacacacac 60acggcaatt 
a ccaggagtat cttcctagat tatttttttc gataaggatc ttccagatat 120agcatgtgaa tctctgt 

act actactgttt gtcaagcaaa attaacattg acatcagtgt 180ttttgttggg ggcagcggaa tcttt 

gaege ctcttcttgc ctctcaagac atgtcaccct 240cactagttag tgtgccagct ggtagtacta cgt 

aegatge tccctccctc cgtaattatt 300caaccttttt gctctctctt tttataaagt caaacctttt a 

aatctgacc agatatctgc 3 60taaaaaatta geagacatge atacatcaaa gcagtagtcc tccctccgtt 

taaaattacc 420tgggtttatt caaataaagt caaactctgt aaaattcaat taaatattta gaaaaatc 

ta 480acagcacctg tagtataaaa gtatgctccc tctgtttgta aaaaagctaa gcaacttttt 54 

Otgagatacgg ataaatcttt agctaaaaca tgtctatata cctttgtatc tagataaagt 600tggaaagc 
tt ttttagaaac agacaaagta tgtgtttgac. attatgaatg ttgagtattt 660ttcctctaat cttgat 

caaa ttttacaaat tttggcttga atagagggac cattattagt 72 0atgaaactac ataaatttgt aaaa 

cactca acataattta cgatgggtca gtgatagcac 780taacttagct tttcataaat gccactgctt tt 

caatagag catgaagcag gacaaattta 840tt 

cgtgtgac ttgaatagag ggagcctgtt ctggttcaac tcaccctgca tgtgtgtctt 900catccctttt 
gctcttccta tctgtggtgt caattgagtg tcccacgtgc atgtgggcga 9 60aacttgaacc tagaaattg 

a catgctccca ctgcccggag eggagtatet ttgtgctttg 1020ttacccttat tgttgctacg tactaca 
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gtg tttagattgg aacttcataa tcaaaagaac 1080ttagtttcct acaatttttt gctaagcaat ataat 
gagca atcaaacttc tatatctgtg 1140gcaaataact aatccattat agttacagtt tagatgcaga cgc 
cagtgtt tcttcccctt 1200ttcggaaaaa agctattcca taataagtgt tggaaattta ataaatgggt a 
ctacgaatt 12 60tgaaaaaaaa agtgtcaaaa attcactaag aaagtacgta gtacaaattt aaactaagat 
1320tccgacactt attaggatcg gagagagtaa gtagcaaact actactccat ccacctaaaa 13 80c 
acgtgattt aactttgtct agatacggat agaaagttgg gatacatccg tatcttaaaa 1440aaaaacgcac 
ttattttaga cgaaggaggg agtatttcaa ccttgatttt aaacggaatc ISOOtacaaaggga atacatgg 
at tgtacaagtg ggctgaccgt atccattatg tactcgtact u 15 60ttgcagtttg aaagcaaagg ctagtg 
taat ttgtaggtgg ttctaggcgt ctagctgttt 1620catggcgtta tcacagccgt gccagtgtgc tcag 
ggccgt acataagttg cttggtgtat 1680gtgt 

cgatct aggatttgcc gtcttacaat tttgctttcc aacttatttt ctgtaaagag 1740atcgatgtga ac 
ttctctgt cgagtaaact gaaattgtct gaataaatat aactcggcag 1800attatgtttt atcgtttgca 
tgcgtaacag gctacacaaa ttgctcgagt cagcagcgag 1860ttgagctcac aacgaatcca tcagcaaaa 
a tactatacta tagtagcaca tcgtttcttt 1920tttcatgacg tttctgtttc ttcctaactt tccagga 
gca ccggagacga cgatgtggtg 1980ataaagattt tgtactgcgg aatctgccac tctgacctgc acgcc 
ctgaa gaacgactgg 2040aagaactcaa ggtacccgat gatccccggg cacgagatcg ccggcgaggt cac 
ggaggtg 2100ggcaagaacg tgagcaagtt caaggccggc gaccgcgtgg gcgtcgggtg catggtgaac 

2160tcgtgccggt cgtgcgagag ctgcgacaag ggcttcgaga accactgccc gggcatgatc 2220ctc 
acctaca actcggtcga cgtcgacggc accgtcacct acggcggcta ctccagcatg 22 80gtggtggtgc a 
cgagcggtt cgtggtccgg ttccccgacg ccatgccgct ggacaagggc 2340gcgccgctgc tgtgcgccgg 
catcaccgtg tacagcccca tgaagtacca cgggctcaac 2400gttcccgggc tgcacctcgg cgtgctgg 
gg ctgggcgggc tgggccacgt tgcggtcaag 2460ttcggcaagg ccttcggaat gaaagtgacg gtgatc 
agct cgtcgccggg gaagaaggag 2520gaggcc 

ctgg ggcggctggg cgccgacgcg ttcatcgtca gcaaggacgc cgacgagatg 2580aaggtaggcg gacc 
cgctgg ttcaggttac ttcccctgtc cggtgcagaa gaaagaggaa 2 640cttgagggtt catgtttgtt tt 
gcgttggt gatgtctttg caggctgtga tgagcaccat 2700ggatggcatc ataaacacgg tatctgcaaa 
catccccctg acccctctct tcgggctgct 27 60caagcccaac ggcaagatga tcatggtcgg cctccccga 
g aagcccatcg agattcctcc 2820cttcgctcta gttgccagta agtcttagga tctcttgcaa taaggag 
aaa tcatgcactg 2 880atcgatcaga gaaatgagat agcatcctga tgaacattgt acgtgtgtgc agcga 
ataag 2940accctggccg ggagcatcat cggcggcatg agcgacacgc aggagatgct ggacctcgcg 
3000gcgaagcacg gcgtgacggc cgacatcgag gtggtcggcg cggagtatgt gaacacggcc 3 060ttgga 
gcgcc ttgccaagaa cgacgtcagg tatcgcttcg tcatcgacat cggcaacacc 3120ctcgacaagg ttg 
cggccac caccgagtga acgtactcag cactgcttac gatctacgtt 3180gttccactgt tagtgctccg t 
agtaaacaa taaacgatca aaactcttgt catctggtgc 3240attggtgtag acatggttgt ttgcgaggaa 
actgagttga aggatggatg gataagtttg 33 00cttcttgccg tgttaatgga ttacctactt agcttcac 
tg caattaacaa attaagaaac 33 60gacacacc 

ca aaagactttc gtcagttttc ttggattata caagtcgtta tggttgggtg 3420tcagtgtgtc acagat 
aatc atactatggt atttaacctg gaagatcgtt tttttggcgg 3480caactcagtg ggttttccca ctat 
gtatat ttataaatat tcaacaagtc atgaggtaca 3 540aagggttgtt gctagaggat agcaacaaga ag 
ctagccaa aagatcatag gcttaaaaaa 3 600gagagaaaag aaaacaaaac tgctatagtt atcgaaatct 
ctcagctcaa attttaaaac 3660cagcataaga ctttctagaa gccttatgaa caagaagagc tagctcatc 
t ttaaaccttt 37 20tcctgcatct gtaaagattg agggtgcaac ccttgaatat aaaatcattc ctgtcat 
cca 3780gatagactat gtagtcaaaa tagtcatttc catgaagaag ggcactttta atacattttt 38 
40gagacttggt atgatactct gaatgtcaac accctggaag atcttttcac tcctatggaa 3900ggacaag 
aaa gcatttcaac tccttttact aaggaagaga ttgacaaggt gattcagaga 3960attcctttag acact 
ataga aagtcacaag gtgccaacgg cgcaatcctg tgccgacggc 402 0tttttatcgg ggaagccagc ate 
ggtaccg agaceggcag cccaccaact aggcegtegg 4080cacacatcct ccagtgtcgg cggccaacat c 
ggcataagt tggcccgttg ggcatcaact 4140cccccgtcgg aacaggtcta gcgcatggac cgtcgtgatg 

gcggcggcaa cgacgtcatc 4200ctatgccgac 
ggcctagccg tcggcctagc ttgccagcgc tatgecgacg tcacattgcc 42 60atcggcacat gctagtttt 
t ttttcttttt tctacatgcc aaattgtata tgtatatata 4320ctcatttact tattacttcc aattatt 
tta atgtgtatat attttgetea ccaattgtac 43 80gaatttgtac cctccgagaa attgetaaaa tgatg 
gagtg acctacaacg agecttggat 4440atgtgagttc ttcttgcccc attgeacaaa aattgtaaat att 
agggttt actggatcca 4500ctagttctag agcggccgcc accgegggga gctccagctt ttgttccctt t 
agta 4555 
<212> Type : DNA 
<211> Length : 4555 

SequenceName : CAD2cvBarlanogenomic (SEQ ID NO: 14) 

SequenceDe script ion : 

Custom Codon 



Sequence Name : CAD2 cvBarlanogenomic ( SEQ ID NO: 14) 
Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 
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RSTGDDDWI KILYCGICHS DLHALKNDWK NSRYPMIPGH E I AGEVTEVG KNVSKFKAGD 6 0 RVGVGCMVN 

S CRSCESCDKG FENHCPGMIL TYNSVDVDGT VTYGGYS SMV WHERFWRF 1 2 0 P DAMPLDKGA P LLC AG I 

TVY SPMKYHGLNV PGLHLGVLGL GGLGHVAVKF GKAFGMKVTV 18 OlSSSPGKKEE ALGRLGADAF IVSKD 

ADEMK AVMSTMDGII NTVSANIPLT PLFGLLFALV 2 4 0 ANKTLAGS 1 1 GGMSDTQEML DLAAKHGVTA DIE 

WGAEYV NTALERLAKN DVRYRFVIDI 3 OOGNTLDKVAAT TE 

312 

<212> Type : PRT 
<211> Length : 312 

SequenceName : CAD2cvBarlanopep (SEQ ID NO: 15) 

SequenceDescription : 

Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

ggcacgagtc gcctccaacg tcttccctta accggccgtc cctacgcttg caccaccacc 60acgcacaga 
c agagcagttt cccagccccc gccggaaccg gatggcaccc acggcggcgg 120agcagacgga gcaccac 

cag cacaccagga aggcggtggg gctggcggcg cgcgacgacg 180ccggccacct ctccccgctc gccat 

cacac ggaggagcac aggagacgac gatgtggtga 240taaagatttt gtactgcgga atctgccact ctg 

acctgca cgccctgaag aacgactgga 3 00agaactcaag gtacccgatg atccccgggc acgagatcgc c 

ggcgaggtc acggaggtgg 3 60gcaagaacgt gagcaagttc aaggccggcg accgcgtggg cgtcgggtgc 

atggtgaact 420cgtgccggtc gtgcgagagc tgcgacaagg gcttcgagaa ccactgcccg ggcatgat 

cc 480tcacctacaa ctcggtcgac gtcgacggca ccgtcaccta cggcggctac tccagcatgg 54 

Otggtggtgca cgagcggttc gtggtccggt tccccgacgc catgccgctg gacaagggcg 600cgccgctg 
ct gtgcgccggc atcaccgtgt acagccccat gaagtaccac gggctcaacg 660ttcccgggct gcacct 

cggc gtgctggggc tgggcgggct gggccacgtt gcggtcaagt 72 0tcggcaaggc cttcggaatg aaag 

tgacgg tgatcagctc gtcgccgggg aagaaggagg 780aggccctggg gcggctgggc gccgacgcgt tc 

atcgtcag caaggacgcc gacgagatga 840ag 

gctgtgat gagcaccatg gatggcatca taaacacggt atctgcaaac atccccctga 900cccctctctt 
cgggctgctc aagcccaacg gcaagatgat catggtcggc ctccccgaga 9 60agcccatcga gattcctcc 

c ttcgctctag ttgccacgaa taagaccctg gccgggagca 1020tcatcggcgg catgagcgac acgcagg 
aga tgctggacct cgcggcgaag cacggcgtga 1080cggccgacat cgaggtggtc ggcgcggagt atgtg 
aacac ggccttggag cgccttgcca 1140agaacgacgt caggtatcgc ttcgtcatcg acatcggcaa cac 
cctcgac aatgttgcgg 1200ccaccaccga gtgaacgtac tcagcactgc ttacgatcta cgttgttcca c 
tgttagtgc 12 60tccgtagtaa acaataaacg atcaaaactc ttgtcatctg gtgcattggt gtagacatgg 

1320ttgtttgcga ggaaactgag ttgaaggatg gatggataaa aaaaaaaaaa aaaaaaaa 1378 
<212> Type : DNA 
<211> Length : 1378 

SequenceName : CAD2 c vBar 1 ano cDNA (SEQ ID NO: 16) 

SequenceDe script ion : 

Custom Codon 



Sequence Name : CAD2cvBarlanocDNA (SEQ ID NO: 16) 
Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

cgggatcaac ttggatgtcc tttgcgggca cggtttcagg aacaacgaca catgcagcag 60ggatctcct 
c caaagactca cacaaaggtg acatgagcgc ccgctttttt gaagccaagt 120tggctaagaa atcgcaa 

age ttggtggagt cggccacctc aggatctgea acaaaaggca 180ccaagggagc tgccaacaca tcaac 

cacaa catcatgttc aaacgcagtc tcctcaagcc 240tcgaatgctc aaccgaaaga gaggcagaag ctt 

caacaaa aaactcagcc aacccaaagc 300cctcgacgtc atcagagatt aggctctgag gacccgcagg g 

aagcaacct tgtcaacaac 3 60cgcatccggc agaaaaggag caagacegga gcaaccctca agaggcacac 

gaaagacgtc 420gaagccaaga ggagacgagt cgcagggacg geggacagge gagaaggggc cgtagaac 

tc 480caagagctcg gcgtccctcg acctagcatc cgaagcactg accggggcac teaatgeata 54 

Oactttatctt gatggcatat gtactcaaac ccatacaatg ttcaccatgc attatctatg 600gaacattc 
ct tcatatacaa cttctgagtg gtcagtgcat aggaattttc attaacaacc 660aaaaacatac ttgggg 

ccta cacacacttt cacagcatgg aaaacttgtt agctttttaa 720agagttgcaa aatctgtcaa gega 

atgttc ttgtgataat tggaacgaag catgtttccc 780cattttcaat gtgtgtctct taccctaact ag 

cacccgac caacaaaatc tgaccatcct 840ag 

ttatatca tcatagagac ccacatgtag gttgaccccc ataacacttg tgtggatatc 900atggaaaatg 
gecttgatea acactttctt tcctacttgg tacaaatggt tatggactta 9 60ctcaattagt gctttagag 

a gctttggctg cagactttgt agcttcccaa tattcatagg 1020tccctccgga gtgggcagcc ccatcta 
cat aggctcaaaa ccagattttt gtaacatgtt 1080agacactttc aacttcatca tagaccatca aggag 
ctggc atgtgacagt gatatatgta 1140tcaattaccc attcaacacg aatagcttgc teatgeatgg tta 
gtcttgc ggcggcgggg 1200cgggaccatc gaacacaccg ccgggcggtc agtaggctag ggttagataa a 
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atctagccg 1260ttttcattca aacttgtgat atataatcaa atttaaataa aaacctttat tttcgtgcat 
1320ttttatttat ttgagggcgt gtttggggga cacggctgga aagtgacatc cccaaacact 1380g 
cacgaagaa aacgcgtcgc caaaaaattc gatccggcgt cagtcctttg ggagacgatt 1440tggatgacgc 
ggctagagat gctctaagtt ctccacgcca tgtttctttc tatatataca 1500cacagcccaa ggtccatg 
aa aagtaaaacg gcacgacgac acgcaccggc gacaacttca 15 60cattacggca catcgctatt acggac 
caca tacaactcca ccgctattct cagccaagtc 1620atacatgaca tgatccaatg gacgactttg tgag 
cgaaac tagaaccttg cggggtttag 1680attt 

tccaat gtggataagt tgtacgcgcc gactagcttt acacttggtt gaaaaaagct 1740tattgtagca eg 
acttctca ctgacatagg aatgtaaaca gtctctccac gccatgtttc 1800tttctagtag tagcatacta 
gtagtaactt ctctttgtcc tacacacacc cagggtccaa 1860gaaaggaaaa cggcacgacg gcacccacc 
g acgacgacga ctccacatca cggttcggta 1920aaaaaagtca aaactcgctg acgtggcacc aceggtc 
gca gtcaactgac gcgctcctct 1980gcgcaggtyt cacttcaagt ttcacctacc actgtgggcc caccg 
ccaat gtgggccccg 2040cgagcttctt actcactgac ctgtctccca ccagcctcct egceggtata tta 
ccccggc 2100ccccaatttc ctctgccttc ccacgagcag cagccggagc acggaatccc ggccgccatt 

2160cctccacctt cagctccgcc caaagatttc catccggcga gatccatggg ctccatcgcg 2220gcg 
gaegege ctcccgcgga gctggtgttc cggtccaagc tcccggacat cgagatcccg 2280acccacctga c 
getgeagga ctactgcttc cagcgcctgc cggagctctc cgcgcgcgcc 2340tgcctcatcg acggcgccac 
gggcgccgcg ctcacctacg ccgacgtgga cgccctcacg 2400cgccgctgcg ccgcgggcct ccgccgcc 
tg ggggtccgea agggegaegt cgtcatggcg 2460ctgctccgca actgccccga gttcgccttc gtgttc 
ctcg gcgccgcccg gctcggcgcc 252 0gccacc 

acca ccgccaaccc gttctacacg ccccacgaga tccaccgcca ggcgaccgcc 2 580gccggggcca gggt 
catcgt caccgaggcc tgcgccgtcg agaaggtgcg cgccttcgcc 2640gccgagagag 

2650 

<212> Type : DNA 
<211> Length : 2 650 

SequenceName : 4CL2promoterseq (SEQ ID NO: 17) 

SequenceDescription : 

Custom Codon 



Sequence Name : 4CL2promoterseq (SEQ ID NO: 17) 
Sequence 



<213> OrganismName : Lolium perenne 
<400> PreSequenceString : 

tcgacgcggccgcgtaatacgactcactatagggcgaagaattcggatcatatggattcgacactggaatttactcccatcg 
ggagcgtgcaaacaaaaaggtgttatagcaagaagacactggcaacattgccagcacagaatttgttacaatcatagaaagt 
tttatgacaggacattgtttcaaccgaaagcaagattacaacaatataatcaagggcttgggtctggttggacatgctcggt 
ccaatggacgatttatttgccgagaccagctcaaggagttgacgagcacacttaagcgccgagatcttaaaggcacccaagt 
caacaagtcgcccatcttgctcttttggcagctccttggacatctcttcgatattggctttgaagccatgacccatcataag 
ctgaaaggctaggagggcaccataggtacgcgaagtacgtttgaatacctcgaggacctccctcgtgttgatggcgaaagca 
tcgatcagctgccccaaggtcttgttttgatcgatcttggggaagatcatcgagtgcatccgcgtcatggatcctttaccct 
tctgaaggaggtcctgaaaaagctggtgagacccgagggtcattgacaaagcattcgccggagaattattcggcaatttatc 
tagagcctcagcagggatgtaggcagcttctggagaaagtgaaagaggaggagctcactaaccaaaatcaaatcgataaagc 
aaaaatcggaaaggaggccaaaaggggattactgagcaaggccaaggaagattggcgaaggagctcatctttttcaatcgcc 
cgagcttcggcagcaagcctggatgcctcttcatccttcagcctctttcttagcccctcgagctcatccttaaaggaatcaa 
cctcctggcgggcctcggcagctatctttatcgcaccctccagcttcgaggaagaagactcgacctccttttgcagccgagt 
cttgtcaacttccagagaagtgtattgggaggcgaaggcc 

tccagagaagagataacagctcacaaatccttaagagataaggaaaaataattagacgaagaactggttgtcaacaaactta 
taatttgatcagggaaatcgtcccacatggatatatcgttaaaacaggaaaagcttacaggtttccctggaggagaagctgt 
aaccacggcagtcaaagaaatctccttccctttggaaagggaagaagttgtcgatatttgagccatgggggctgcggcagga 
gtcgaagcctcggaagcggctggattcggcacgatggcaccagatttggccttcttggccggaggctcgatgaagccatctt 
cactgcaagaacaaaaaactagcgaagtcagaattcaatgcatatggcgaagttagaacacaatcctggaaaaggaagcaag 
gacttacaattcatagagaccatcttcatcggcaaagccgccggatgatctctttggaggtagtgcctcggccttttccgta 
gctgcatcaacaaaggcagcacgatcagcatcgtcatcatgcattgaccccgctgtatcgctcatatcatcggcagagaatc 
gaggattgatggaaaaagcctcaggattcatcggatcatcatgttgatctatcgggcttgcattccctagagtatgggaccc 
tacaaggactaaggaatcccttttcttggaaaaattgttcgacaggtcttgcaaacgttcaagagccgtaaggatctgtcgt 
agttgacgagtgagaataatggcagttaaaataatcaaaggaacatgacaataagagcataaaggggaaatttacctcggtt 
ggcagatgaccagcgtcaaatggcggttgaggagatatcagtggaattgaatcttcctggctaaagagggtgagacaccgga 
cttcgtcaagcagttctttttcggataattcagcaatatttactctagtctcgtccctgggacccgaatacaaccacatcgg 
atgggtcctagacatgatcggctgaactcgatgttttaag 

aacacagcggctacctcagtacctatcatggtttgaccatcggattctttgatccgaaggaatctatcaaataacttgtcta 
ctgttggtttttcatcgggtgagaggatatttttccaagacttcttgggcttgcttctagaacatcggagaattggggggga 
gctgggagtcggctgctgatgagtccttaatataaaaccacttcagcctccagccttgcacggattctttcatcgggaagtt 
gaagtagttgacttccttacgagcaacaaaaccaaccccaccaatgacgaaggacccaccactgctgttatatcttttcacg 
aagaaaatcttcttccacaaaccaaagtggggctcaatgcccaaaaacgcttcgcagagggtgataaagatggcaaggtgaa 
ggattgagttgggggttaacttccataattgaatctcatacactcgaaggaggtggtgaagaaatttgtgagcgggaagcga 
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aagacctcggtacaagaaggataagaacatcacagtaaaaccggcaggaggattgggccgtgaaattgcacctggaagaata 
acattcccctcgtcagaagaaattattacgaggctccgggccctcttttcatctcgcttcgtggtggtagaagctggccaat 
cgccagggataggcccggccgtggagcttgacggcgctggcggtgccggagctgagggaggagcatctggcgcgcttctccg 
cggcggattcgaaggagccctgacggtggtgccactgctcacggcgctggtggcgagagtgggattcttcttcttcaccatt 
gtgagatttgagggagatctgggagttgcgacggtggcgtggtagttgcaaacgaaaaggatgaatgaggaagaagggacgc 
aaggatgaagtgtggaaaggggagtttaccccaagagattataaagtgaaaggaaaacctgagaattgagcgggcacgtgtc 
gttgctctcaatttattgaggggattttttctcatcatag 

atcgcggaaatcgaggagtcaccttggtaactgcacgcaagtagtggtcatttcttaaacagaaccgcatagaagtaggatg 
ggaccgtcaggtcacgtcctatcagtcagatttacaacagtaattacatcatcactgacgtcaaagtatgcttgaagtatcc 
gaagaaaagtcgaaatttgggctcgaagactttcttgcagagaagcgcgtgaaaggaatatctaaggaaagggtcaaaacat 
tcggctcgagtctacgcacggattgcaagcatccgtacctagactcgggggctactcccatcgggagcgctggacgtgcacc 
cgataaatttagacgaggatgaaaaccggaaacccaagtgctactcccatcgggagcgccgattacgcacccgacaaacttt 
tttgcactccaggatcatgcccggggacttaattctgtgtagagtagcgttgttttgtcttcggcagttaaccagcaaagct 
ggacacgttactcaatatcctttacgcattaaacccttacttgaagaattgaagccccgatgcaaatatatcggatgaccta 
tgaaggcctgcggaaagcttcgggagaagaagacattcgagtggcacaacttgagtctacgaacggattgcaagcatccgta 
cctagactcgggggctactcccatcgggagcgctggactcgcacccgatagaaggagatgatgatattacaagaaggacaag 
aagtatcaagggagaagaacattcggtggaggcatgctttagtctctacccgaaaaaacttcggctagacactcggggggct 
actgacgtgggcattacccttcgggtaactgatattgccctatcctgtacgacccaactggaggcccatgaagacactcgaa 
ggcaaggtggaccactacgtcggtgccgaagggggttccttgaagaacaagacgaagaaaagaagaatacaagaaaagtata 
gaactaggatcttttgtaacctggtcgtacccggacagat 

ctctcgagacctggccccctacatatgggctaggagaggggctgccgagagggacacacacaatcttagcaattttagccac 
cataagtccagagcaaggtccccgtagaacttagcctctcgacgagatcacagccgaaaccttcggcaccccattgtaaccc 
gatattttcatagtcaagatcagacaggtaggacgtaagggttttacctcatcgagggccccgaacctgggtaaatcgctct 
ccccgcttgtttgataaccgatggcttgtgtcagcttacatgattccatctaccctaaacctcaaacggagggcattgccga 
ggagtaccctcgacattcccctccaccaatggtctcacataaattcaacaaagcaaactcataaaaagtttaatgagtttca 
gaaagaaataaaactaggcccctcctttgagaatctacgaatgattcaccatatcatctcgcagttagtgatgagtaactaa 
gtctcaaatttcccgacgcatggcgaaaaaggtagcgaacttaaaatgtgaggaatgaatgccacatatgcatggtgcatcg 
agtattctcattttagtcttggattactccctttagatgttgacaccatcccaaaaatacaacttggacaagttgttcattt 
cactagtatgaatttcagtaaatcgggcaatactccaacactcattcaccccctaggcgaggttagctcagatcaacgtcgg 
gtgtcttcatcgagttaatgtcgtcacacgcacacacacgtacgcgcacacacacgtgcgcaaacaaaaagaaaactaggaa 
ccttctcacgtagcctaggtcttgtcctgtaagaaaaaacccaggtccaccctagtttcgaaccaaaatatttttgaagata 
cattagtaagatatttttgaaaataaaaccgcaaaaagggaattgaaaaatatggactggctgttttgtccaaaaccacatc 
tttcggagaaccacgagggtatctattgatgggctcatac 

tatacctgggcatgtgttgggccaggcctcatgtcgggccgaggaaagcccgacgctgaaaaatcaggcccaagcttaaccc 
ggcccgaccaaatacccaccaaacccgttgggccatcaggttgcgggccgggcagtagtgtaaaacaccgatttcgggctac 
ataggcccggctcgtttgtcgggcaaacatttctagacctaagcccgagtttttcgggccgggctgcccatggccaggtata 
gctcataacgacgtatgacatttcgagcaattgatgcaaagcacgtgtagggttttatcccatccgtgtggcgtgtgtaggg 
tgtaaatgaataggataatttcctcgccgaaactggtcccaaattcgctttgaagtgtccatatatgattttaaagaatgtg 
acaaataaagatatccaatttcgaaatagtgctccggatacggtataggatatggtatagcaaataacatgctgatatggat 
tgtccgatattaaattaagataatccaaatgttttaaaccgcataattcgatttttgagtcaaaagcgaatgccaattcaga 
aggttagcagttattgagtttcaaaatttatttggcgagcatatctagttctaaattctatcacgtaaattgtgtctttttt 
taataactacacaagactaaaagtttaaatctctctcaagatttgcgaaaactatagctatctactgatatatatatccgac 
tatatttgttttcggaccgcatgcgtcctatttccgattcgaatctgcactccgatatatccacattgaatctaaaaccgat 
caatatttgctccgatctaaatccggaaaaatatgtggtgaaggatatggtataagcaaaatccgatttgatccatttgtac 
ctctaggcgtgtgcaagacctggaggaaagaatggcgcatctgtagggtgcagtcccaccggtggaaaatgtgagctcaccg 
tattgtcccccgatggagcatcgaaacggagtcggaacac 

gatttgcgccacgtacagagcatgcatgatttcccttgtatgcggtccaggatcttaaactgccttccatttccaggaacct 
accgattggctgcaagccgtagctagcggtttgaagtcacggcattgccgcccccgattaacccacccgtcgcgcgcgcggt 
cggtcgtttcaccgtcctgcctaggctacgcacgcgcgcgcgcagttgggccagttgtaggtaagccgactcgagatcacac 
acccggcctcacctactacctctcgccgtcgcggtcaccgtgtcacactcacgcccaggggagccacccgcccacacggcgc 
ctagctcatcccctctcactactcttcttctcctccctctcacctcgccgtcgacccagctcccggctctataaattccgca 
ctactcgaaccaacatcgcccaggcctttgccttttacgacgaatcctaccaaaccgagctaccagatccttctctactaat 
cgagctccctacgctgctccgcctgtcttcgtttccgcctcaccgccggccggttctccgctccaagctacgtccgtccgtc 
cacatatatagcatcgacatgaccatcgccgaggtcgtggctgccggagacaccgccgccgcggtggtgcagcccgccggga 
acgggcagaccgtgtgcgtgaccggcgccgccgggtacatcgcgtcgtggctcgtcaagctgctgctggagaaggggtacac 
cgtcaagggcaccgtcaggaacccaggcatgtcacccatgcattcatcattttcttactagtcgtatgcgttatgcgacttg 
tgtattaactattgtggactgcatgcagacgacccgaagaacgcgcacctgagggcgctcgacggcgccgccgaccggctgg 
tcctctgcaaggccgacctcctcgactacgacgccatccgccgcgccatcgacggctgccacggcgtcttccacaccgcgtc 
ccccgtcaccgacgaccccgtacgtactccatagaactcg 

gcacccctagcttctctccgttctctctgtatgtctgtcaccgtcgatcgccatggcagcacgcatgcatgcgcgcgcaacg 
ctagctagacgctgaccgactcattgtgcaggagcaaatggtggagccggcggtgaggggcacgcagtacgtcatagacgcg 
gcggcggaggccggcacggtgcggcggatggtgctcacctcctccatcggcgccgtcaccatggaccccaaccgcgggccgg 
acgtggtcgtcgacgagtcgtgctggagcgacctcgacttctgcaagaaaaccagggtgggtgctgcatgctcaatttttat 
tatcatagctaccctttttctgcaccatgctgcatttcttttccaaaaacaactctcaaaagatatgctacgtggtgagttc 
ctatagctgaattattacaactaccaccctatcgatcactaccgccctaaaagtgttcaacttttgaaggcaaccaaaacca 
atacatgaacgacgatcgtgtgcgcttgtcgtcgttatcattagcctctgtagctctaattttcacctatgtacgcatggat 
agacgattcggaaatacagttcagtttacctaccatatactatgccgaaatcgaacgcacacaggtgtgaggcagcagccgc 
tcacgagttatgcgccgaaaccgacatctcggaatcttcagtccacaatcaaaaaatagacacctggtaccactacaaaatt 
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whole of document 

GenBank accession AF033540, Heath RL et al, "Lolium perenne caffeic acid O- 
methyltransferase (OMT3) rnRNA, complete cds", 29 January 1999 

GenBank accession AF033539, Heath RL et al, "Lolium perenne caffeic acid O- 
methyltransferase (OMT2) rnRNA, complete cds", 29 January 1999 

GenBank accession AF033548, Heath RL et al, "Lolium perenne caffeic acid O- 
methyltransferase (OMT1) rnRNA, complete cds", 29 January 1999 

GenBank accession AFO 10291, McAlister FM et al, "Lolium perenne bispecific 
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Capellades M et al, "The maize caffeic acid O-methyltransferase gene promoter 
is active in transgenic tobacco and maize plants," Plant Molecular Biology, 
1996,31:307-322 
whole of document 
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claims numbered 20. To distinguish them, 19a and 20a refer to the first claims 
19 and 20, and 19b and 20b refer to the second claims 19 and 20. 
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Continuation of Box No: II, Observations where unity of invention is lacking 

The international search report has been drawn up in respect of the entire international application but the International 
Searching Authority is of the opinion that the application does not appear to comply with the requirements of unity of 
invention as set forth in the PCT regulations (Article 34(3), Rule 68(1) PCT). 



The separate groups of invention are: 

1. Claims 1, 2, 6 to 12, 16, 17,19a, 19b and 23 to 26 (partial) and claims 3, 13 and 21 (complete) are to 

4-coumarate-CoA ligase (4CL) from ryegrass (Lolium), the nucleotide sequence encoding it, the promoter from 
its gene and various uses of them. 4CL from ryegrass is considered to be the first "special technical feature". 



2. Claims 1, 6 to 1 1, 16, 17, 19a, 19b and 23 to 26 (partial) are to 4CL from fescue (Festuca), the nucleotide 
sequence encoding it, the promoter from its gene and various uses of them. 4CL from fescue is considered to be 
the second "special technical feature". 

3. Claims 1, 2, 6 to 12, 16, 17, 20a, 19b and 23 to 26 (partial) and claims 4, 14 and 22 (complete) are to 
cinnamoyl-CoA reductase (CCR) from ryegrass, the nucleotide sequence encoding it, the promoter from its 
gene and various uses of them. CCR from ryegrass is considered to be the third "special technical feature". 

4. Claims 1, 6 to 11, 16, 17, 19b, 20a and 23 to 26 (partial) are to CCR from fescue, the nucleotide sequence 
encoding it, the promoter from its gene and various uses of them. CCR from fescue is considered to be the 
fourth "special technical feature". 

5. Claims 1, 2, 6 to 12, 16, 17, 19b and 23 to 26 (partial) and claims 5 and 15 (complete) are to cinnamyl alcohol 
dehydrogenase (CAD) from ryegrass, the nucleotide sequence encoding it, the promoter from its gene and 
various uses of them. CAD from ryegrass is considered to be the fifth "special technical feature". 

6. Claims 1, 6 to 11, 16, 17, 19b, and 23 to 26 (partial) are to CAD from fescue, the nucleotide sequence encoding 
it, the promoter from its gene and various uses of them. CAD from fescue is considered to be the sixth "special 
technical feature". 



7. Claims 17, 18, 19b and 23 to 26 (partial) and claim 20b (complete) are to caffeic acid O-methyltransferase 
(OMT) gene promoter from ryegrass and various uses of it. Caffeic acid OMT from ryegrass is considered to 
be the seventh "special technical feature". 

8. Claims 17, 18, 19b and 23 to 26 (partial) are to caffeic acid O-methyltransferase (OMT) gene promoter from 
fescue and various uses of it. Caffeic acid OMT from fescue is considered to be the eighth "special technical 
feature" . 

In order for there to be unity between the four types of enzymes claimed, they have to share a significant structural 
element, that is a structural element that defines the specific biological activity of the enzymes, and the significant 
structural element must be disclosed in the specification. No significant structural element has been identified as being 
shared by the four types of enzymes, hence there is lack of unity between the enzymes. In addition, all four types of 
enzymes are known in the prior art, for example, in US 5 952 486. Hence unity of invention is also lacking between the 
enzymes from ryegrass and the enzymes from fescue. 



Note that the application as filed contains two claims numbered 19 and two claims numbered 20. To distinguish them, 
19a and 20a refer to the first claims 19 and 20, and 1 9b and 20b refer to the second claims 19 and 20. 
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